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METHOD AND APPARATUS FOR DELIVERY OF 
TARGETED VIDEO PROGRAMMING 

FIELD OF THE INVENTION 

5 This invention relates generally to the delivery of targeted video programming to a 

viewer, and more- particularly to the determination of various viewer characteristics for the linear 
delivery of video programming targeted to the viewer's characteristics to create a targeted or 
customized, apparently linear television program. 

1 0 BACKGROUND OF THE INVENTION 

Currently, recording of television programs by individuals for viewing at a later time is 
generally performed using commercially available Video Cassette Recorders (VCRs). Typically, 
a VCR may be either manually placed into a record mode or may be programmed to record a 
selected program at a later time. To program the VCR, the user either enters a date, time and 
1 5 channel of .the program desired to be recorded, or enters an identification code of the desired 
program. 

Viewers of television programming increasingly have more choices as to which 
programs to view. For example, cable television provides a dramatic increase in the number of 
channels available to a viewer in comparison to the channels available by way of a conventional 

20 television antenna. Digital satellite systems provide even more viewing choices. Digital 

broadcast of programs over cable television systems is expected to further increase the number 
of channels available to viewers. 

One effect of the increase in the number of viewing choices is increased difficulty in 
deciding which programs to watch. People,. particularly those with busy schedules, may not have 

25 the time to select and view programs to determine which programs they may or may not like. 
Programs that may otherwise be desirable to a viewer may never be watched if the program is 
broadcast at a time that is inconvenient for the viewer. Users may select certain programs for 
viewing to determine if they like the program. However, with several hundred program 
selections each week, this task can take a considerable amount of time and is likely to cause 

30 certain desirable programs to be overlooked. 

There are companies, who sample television-viewing habits of the population by 
monitoring the programs watched by a very small set of TV viewers. These companies collect 
other demographic information also about the people they monitor for the generation of the 
sample. These samples give valuable information about the television viewing habits of the 
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population covered by the sample. By analyzing these samples periodically a general 
mathematical model can be formed which explains the behavioral patterns of the population in 
television viewing. Each individual viewer would have a very personal liking for television 
programs which.may vary considerably from a model derived from the sample behaviors of 
5 general population. At the same time a mathematical model derived by monitoring the behavior 
of a single user may be inaccurate because of the limited amount of information which can be 
gathered by watching only a single user for a short period of time. Sending the entire sample of 
behavioral pattern of the sample population to every viewer device to aid computation of the ' 
mathematical model is counter-productive because of the high cost of bandwidth required to 
10 transmit this information to each device and the processing power and memory requirement for 
the viewer device to process this information. Sending personal viewing habits of every user to a 
server to compute the mathematical model for the, individual user would raise privacy concerns 
for the viewer and also requires a return channel from the viewer device to the server. 
With a mechanism to automatically determine personal preferences of a viewer 
1 5 accurately, a very personal TV viewing environment can be presented to the viewer. In case of • 
households with multiple members, by correctly identifying individual members and their 
preferences, an apparatus can provide an entertainment experience which is most pleasurable to 
the individual viewer. 

Methods have been developed for providing text data to viewers. A closed captioned 
20 encoding technique transmits text data in synchronization with its associated video data by 
inserting the closed captioned text data into a vertical blanking interval of the video signal. 
However, the closed captioned text data must be inserted into the vertical blanking interval of 
the video signal by the producer of the video programming. As a result, the vertical blanking 
interval of the video signal cannot be used by the head end operator to insert other text data such 
25 \ as spbrtsl Weather, stock market, news, advertising and other data. 

£ ; Electronic program guides (hereafter "EPG") provides viewers with on-screen listings of 
upcoming television programs on cable television channels. The EPG is provided by an EPG 
data service. EPG data is converted into a video signal at the cable head. The EPG data is 
converted into a video signal at the cable head and transmitted to the viewer's television by a 
30 dedicated cable television channel. After tuning to the dedicated cable television channel, the 

viewer then wait for the programming for the desired time period is displayed. Often, when EPG 
data is used, the cable head end operator must dedicate a separate cable television channel to the 
EPG data and create video signals from the EPG data that are provided by the EPG service 
providers. 
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One method of solving this problem is modulation of the EPG data onto an FM carrier 
and transmitting the FM carrier with a video signal on one of the cable television channels. A 
dedicated peripheral device is provided at the viewer's television tuner that demodulates the EPG 
data from the FM carrier. The EPG data is then stored until the viewer requests presentation of 

5 the EPG data on the viewer's television. Upon selection, the EPG data is then displayed on the 
viewer's television in place of the other video programming. 

A data controller is disclosed in U.S. Patent No. 5,579,055 that manages the flow of text 
data and electronic EPG data. 

Additionally, preference agents for television programs have used Bayesian methods. 

10 See for example U.S. Patent No. 5,704,017 (hereafter the m 017 Patent"). However, in the '017 
Patent, a collaborative filtering system is used to predict a desired preference of a television 
viewer based on attributes of the viewer. A system implementing '017 would need to 
communicate to a belief network through a two way communication network, disclosing private 
viewer information to the network. '017 does not leverage the rich EPG information available 

1 5 about television programs which can be used to identify various traits which contribute to 
viewer's choices. 

SUMMARY OF THE INVENTION 

An object of the present invention is to provide a method and apparatus to select and 
deliver video data targeted to a specific viewer, typically a television viewer. The video data 
20 may be targeted in accordance with various characteristics of the viewer, including viewing 
characteristics, demographic characteristics, shopping characteristics, and others. 

Accordingly, an object of the present invention is to provide a method and apparatus for 
determining user preferences for television viewers. The present invention defines a method to 
analyze sample viewing habits along with associated demographic and other characteristics of 
25 the general population to generate parameters of a mathematical model that explains the 

relationship between the viewing patterns of the population and the associated characteristics, 
and that can be communicated to viewer devices using a one way communication channel such 
as a broadcast network. The present invention also defines methods for using these parameters 
by viewer devices to compute the user preferences of individual viewers without divulging 
30 private information about viewing habits to the outside world. 

The invention also details an apparatus that relies upon a viewer's computed preferences 
to store and present broadcast content that may be of interest to one or more individual viewers 
utilizing the same apparatus to view video data. Thus, the invention also discloses a method to 
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compute the preferences of multiple viewers accessing the same viewing device, e.g. a multiple 
viewer household with a single television set. 

In another aspect, the present invention determines viewing preferences of a user by 
monitoring programs viewed by the user and causes recording of programs corresponding to the 
user's preferences. In accordance with the principles of the present invention, apparatus for 
causing recordation of television programs comprises a preference agent for causing retrieval of 
attribute information corresponding to each television program viewed by a user of the 
apparatus. The preference agent generates classification information indicative of viewing 
preferences of the user as a function of the attribute information. A recording manager causes 
recordation and storage to a storage device of television programs having attribute information 
that matches the classification information. 

In a further aspect, the invention determines viewing preferences of a user by processing 
attribute information associated with programs viewed by the user and creating a profile of 
characteristics describing the viewer. The characteristics may include, but are not limited to, 
viewing preferences, demographic information (e.g. age, sex, education, occupation, income, , 
political and religious affiliations, marital status, sexual preferences, ethnic background, 
geographical location of home and work), reading preferences, shopping preferences, music 
preferences 

Embodiments employing the principles of the present invention advantageously cause 
recordation of programs that match certain viewing habits of the viewer. Such embodiments 
therefore provide the viewer with stored programs that match certain viewing preferences of the 
user, which can be viewed at the viewer's leisure. The viewer is therefore relieved of the burden 
of deciding which programs from among several hundred possible programs to watch. 

In accordance with a further aspect of the present invention, programs may be recorded 
for storage in accordance with available capacity of the storage device. Moreover, programs 
may be deleted in response to selections by the user or based upon a priority, indicated by 
viewing preferences of the user, in which programs having lowest priority are deleted first to 
make room for newly recorded programs. The priority of programs may also be a function of 
time, in which more recently recorded programs are given higher priority. 

In accordance with further aspects of the invention, determining which programs to 

record may also be a function of priority in which program s specified for recordation are given 

highest priority, followed by programs having attribute information corresponding to one or 

more user specified criteria, then followed by programs having attribute information 

corresponding to the recordation preference information. 
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In accordance with further aspects of the invention, the user specified requests may be in 
the form of a first type of request comprising information indicative of a specific program and a 
second type of request comprising specifications indicative of one or more programs having 
attribute information corresponding to the user's specifications. 

5 In accordance with further aspects of the invention, the user may cause recordation of a 

currently broadcast program being viewed by the user by causing generation of a pause input. 
This advantageously allows a user to interrupt viewing of a currently broadcast program by 
recording the remainder of the program for subsequent viewing. Program viewing options may 
be presented to the user in the form of a menu that provides an easy to use interface for selection 

10 of programs and viewing and other options including play, pause, delete, fast-forward, rewind 
and so forth. 

Preferably, the preference agent organizes the recordation preference information in the 
form of a database organized in accordance with categorization parameters. Programs may be 
received On either analog or digital formats. Programs stored in digital format are 
15 advantageously presented to the user in the form of additional channels. This allows the user to 
easily switch between programs (either recorded or broadcast) simply by switching channels. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a high-level block diagram of a Program selection device employing the 
20 principles of the present invention. 

Figure 2 is a high-level block diagram of a system employing the principles of the 
present invention. 

Figure 3 shows two examples for Program information. 
Figure 4 shows examples for traits and Liking values. 
25 Figures 5a-b are flowcharts illustrating the Data analysis performed on a representative 

sample. 

Figure 6 illustrate the process of computing the error in prediction of user choices 
Figure 7 illustrate a step in the process of regression analysis 
Figure 8 shows the relationship between two correlated traits. 
30 Figures 9a-c illustrate the process of determining trait-ness of a trait in a program. 

Figure 10 is a block diagram-showing an example for the Liking distribution record 

format. 

Figure 1 1 lists some sample values for different fields described in Figure 10. 

Figure 12 is a block diagram showing an example for the trait-ness record format. 
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Figure 13 shows an example for broadcasting trait-ness as a part of EPG data 
Figure 14 shows an example for user selection record format. 

Figure 15 is a block diagram showing the inputs and output for the generation of user 
selection history 

5 Figure 16 is a flowchart illustrating the process of learning the liking values, performed 

on a program selection device. 

Figure 17(a) is a block diagram showing the inputs and output for the computation of ■.. 
relevancy value for selection history record. 

Figure 17(b) is a graph representative of relevancy over age. 
1 0 Figure 1 7(c) is a graph representative of repetition rate over relevancy. 

Figure 18(a) is a block diagram showing the inputs and output for the process of 
updating of past selection history 

Figure 18(b) is a flowchart illustrating the process of updating of past selection history 
Figure 19 is a flowchart illustrating the process of computing liking values performed on 
15 a program selection device. 

Figure 20 is a block diagram showing the inputs and output for the computation Liking 
for programs. 

Figure 21a illustrates distribution of income for different programs. 

Figure 21b illustrates distribution of gender for different programs. 
20 Figure 22 is a system architecture for providing targeted advertising. 

Figure 23a is a graph illustrating the relationship between Belief function and probability 
for a user belonging to particular demographic trait value. 

Figure 23b is a flow chart of a demographic trait record format. 

Figure 23c is a flow chart of an advertisement targeting record format. 
25 Figures 24 and 25 are block diagrams illustrating operation of certain functions 

performed by the television recording system of Figure 1 . 

Figures 26(a)-b illustrate alternative hardware configurations in systems embodying the 
principles of the present invention. 

Figure 27 is a flowchart illustrating additional aspects of operation of the television 
30 recording system of Figure 1 . 

Figure 28 is a block diagram of one embodiment of a system for providing EPG data and 
text data to a viewer. 
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Figure 29 is a block diagram that illustrates one embodiment of a data controller for : 
receiving EPG data and text from data providers, formatting the data for display and inserting 
the data into a vertical blanking interval of a cable television channel. 

Figure 30 illustrates one embodiment of an information field of the ERG data read from 
5 the EPG database of Figure 29. 

Figure 3 1 illustrates a data format of data read from the database for insertion into an {. 
assigned cable television channel. 

Figure 32 is a flow chart illustrating the operation of the EPG transaction formatter of 
Figure 29. 

10 Figure 33 is a flow chart illustrating one embodiment of the operation of the text ; 

transaction formatters of Figure 29. > 
Figure 34 illustrates one embodiment of a set top box for use in receiving text data and j 
EPG data. 

Figures 35-41 illustrate various aspects of the process of creating and using multiple 
1 5 viewer profiles according to the present invention. 

Figure 42 illustrates a method for distributing targeted electronic content without 
compromising the privacy of users. 

Figure 43 illustrates an embodiment of a system according to the present invention. 
Figure 44 illustrates custom linear programming according to the present invention. 

20 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

In Figure 1, a television control system 100 operates in accordance with the principles of 
the present invention to cause recordation of television programs in response to user inputs 102 
anii television signals 104. Television control system 100 transmits signals to a television 

25 monitor 108 for viewing by the user. Preferably, in digital embodiments, programs that are { 
recorded by system 100 are presented to the user in the form of additional channels. Thus, the 
user can rapidly determine, by changing channels, the stored programs that are available for 
viewing. The user can also change channels between stored programs or between stored 
programs and currently broadcast programs. If the user changes channels from a recorded 

30 program to another program, playback of the recorded program is preferably paused. 

. Alternatively, whether the playback of the recorded program is paused or continued, is a user 

selectable option. As described further herein, the user may specify programs for recordation by 

specification of a particular program, or by specification of particular attributes of the program 

such as comedy/drama, actor(s). When manually specifying programs for recordation, the user 
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may specify that the program is to be recorded once or repeatedly, such as weekly, when 
broadcast. 

Signals 104 include a first component 105 which contains the information necessary to 
display video and audio components of a television program on television monitor 108. Signals 
5 104 preferably also include a second component 107 termed herein "attribute information." An 
example of such attribute information 107 is the information available by way of the DVB-SI 
and ATSC-SI formats and various proprietary formats such as StarSight EPG Data and TVData 
available from StarSight Telecast, Inc., Fremont, CA, and TVData, Glen Falls, NY, respectively. 
Attribute information 107 for any particular program varies depending on the program 

10 type, but typically includes a plurality of categories such as start time for the program, duration 
of the program, the title of the program and other, attributes (categories) of the program, together 
with an associated value corresponding to each of the categories. Preference agent 1 1 0 processes 
the attribute information 107 to generate if "category- value" pairs 1 15. For example, if an 
attribute for a program is duration, then the category may be duration and the value for that 

1 5 category may be 120 minutes. If the attribute for a program is title, then the category may be 
title and the value may be "Star Wars." Other category-value pairs for a movie may include a 
description category with a short description of the movie being the value, a primary actor 
category with the names of the primary stars of the movie being the values, a director category 
with the name of the director being the value, a theme category with the theme such as 

20 adventure, comedy being the value, and a ratings category with ratings by particular critics being 
the value. Category-value pairs for a sports game, such as a football game, may include names 
of the teams who are playing, the location of the game, and the specific tournament, such as the 
play-offs, or Superbowl, etc. 

The category-value pairs 1 15 (preference information) are indicative pf viewing 

25 preferences of the user. The data shown in Figure 1 as being associated with the category - 

value pairs 115 contains weighting information for the associated category value, in addition to 
other information shown by way of example further below. Preference agent 110 maintains the 
preference information 1 15 in the form of a preference database 1 16. Television programs 105 
recorded by the system 100 are preferably stored separately together with the associated attribute 

30 information 107. In an alternative embodiment, the category value pairs 1 1 5 (with or without 

the associated values) are stored with the television programs 1 05 and the raw attribute 

information 107 is not maintained by the system 100. 

Preference agent 1 10 generates, in response to user viewing habits, data for each 

category stored in preference database 1 16 and for each value of each category. The data 
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generated by preference agent 1 10 for each category and value is preferably indicative of the 
amount of time that the particular category and/or value is watched by the user relative to the 
total amount of time that the particular category and/or value is available to be watched. The 
relative amount -of time that a program is watched by a user is a convenient indication of the 
5 user's relative viewing preference. However, other indications of user viewing preferences may 
also be used. Program source switch 114 operates in response to user inputs 102 to select either 
presently broadcast programs, by way of television signal 104 or stored programs from storage 
devices 106. 

Recording manager 1 12 operates to cause recordation and storage of television programs 
10 105 and attribute information 107 in accordance with information generated by preference agent 
1 10 and stored in preference database 116. Recording manager 1 12 also responds to user 
requests to record particular programs and to user requests to record programs having specified 
category- value pairs. 

The signals transmitted to the monitor 108 preferably take a conventional analog form. 

15 Alternatively the signals transmitted to the monitor 108 maybe digitally encoded. The exact 

form of the signals transmitted to the monitor is not critical and may take a form as required by a 
particular monitor. The television signals 104 received by the television control system 100 may 
take one of a variety of signal formats including analog encoded signals that are encoded in 
accordance with the well known NTSC or PAL standards. Alternatively, the signals 104 may be 

20 digitally encoded in a manner as transmitted by commercially available digital satellite systems 
(DSS) or in accordance with the MPEG-2 (Motion Picture Expert Group - 2) standard. In any 
given embodiment of television control system 100 the signal 104 may take a variety of the 
aforementioned forms. For example, television control system 100 may be coupled to receive 
inputs from a digital satellite system, the inputs being digitally encoded. The television control 

25 system 100 may also be coupled to receive inputs from a Community Antenna Television 

System (CATV) in which the signals are either encoded in analog or digital form. The television 
control system 100 may also be coupled to receive analog or digital signals from a conventional 
home antenna. 

The attribute information 107 may be transmitted to the television control system 100 

30 contemporaneously with the television program 105 in a variety of ways including industry 

standards, such as DVB-SI (Digital Video Broadcasting-Service Information) as defined by the 

European Telecommunication Standards Institute (ETS), or the ATSC digital television standard 

as defined by the Advanced Television System Committee (ATSC). By way of example, in the 

DVB-SI protocol, programming for the next six hours is transmitted every eight seconds for 
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each channel.. As a further example, program information for the next seven days is available 
from the interactive on-screen TV program guide available from StarSight Telecast, Inc. 
Programming information further into the future, such as for the next seven days, may also be 
obtained in other ways. For example, by receiving the information in a time-multiplexed manner 
5 over a particular channel. Such information can easily be transmitted when the user is 

performing an action that does not require a moving video image on the screen, such as when 
the user has a control menu displayed on the screen. 

Alternatively, television control system 100 can download the attribute information 107 
separately from the television program 105 by way of a separate communication session via a 
10 modem or the Vertical Blanking Intervals (VBI) contained in television signals. Such separate 
communication sessions include data download mechanisms supported by the MPEG-2, DVB- 
SI and DSS protocols. 

The attribute information 107 can take a form under the DVB-SI protocol as shown 

below: 
15 event_id, 

start_time. 

duration, 

DESCRIPTORS 

DESCRIPTOR^ 
20 ... 

DESCRIPTORS 

The event-id field Is a unique alpha-numeric code assigned to a program. DESCRIPTORS can 
be "Short Event Descriptors," "Extended Event Descriptors" or "Content Descriptors" which 
includie tKfe following information: 
25 Short Event Descriptor: 

{ 

eventjiame-length 
event_name, 
event_description-length 
30 event_description 
} 

Extended-even 

{ 

ITEMl 
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ITEM2 
ITEMn. 

5 } 

content descriptor: 

{ 

CONTENT 1, 
CONTENT2, 

10 

CONTENTn. 

} 

ITEMs include the following information: 

item_description_length, 
item_description, 
item_value_length, 
item_value 

20 } 

An example of item descriptions can be "Director" and item value can be "Martin 
Scorsese". CONTENT includes the following information: 

{ 

DVB-SI defined theme, 

25 DVB-SI defined sub-theme, programmer defined theme, programmer defined 

subtheme, 

} 

An example of theme and subtheme are MOVIE and COMEDY, respectively. The 
programmer defined theme and sub-theme are values that may be provided by the EPG Data 
30 provider. 

Category- value pairs 1 1 5 are generated from the above type of information. The 

category- value pairs 115 take the following format: Category Name - Category Value, where 

category name cart be "Title", "Director", "Theme", "Program Type" etc. and category values 

can be "Seinfield" "Martin Scorcese", "Comedy", "Sitcom" etc. Generation of category-value 
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pairs 115 from attribute information 107 allows generation by preference agent 1 10 of categories 
that are not explicitly present in the attribute information 107. For example, category- value pads 
1 15 can be: Title-49ers, Description-football, and Description Search Rule-football(AND) San 
Francisco. Thus; preference agent 1 10 is capable of generating category- value j>airs 115 from 
5 attribute information 1 07 even where there is no field in the attribute information that 
corresponds to the created category-value pair. 

Preference database 1 16 is preferably generated initially by downloading category- value 
pairs from a third-party source such as StarSight Telecast, Inc. Advantageously, such sources 
may provide information customized for particular geographical areas and dates. For example, 

10 the database may contain data that gives sporting events involving local teams higher ratings 

than other sporting events. In addition, seasonal or holiday programs may be indicated as being 
preferred during particular seasons or holidays. For example, programs involving summertime 
activities would be indicated as having higher weighting during the summer than at other times 
of the year. The preference database is modified as described herein in accordance with the 

1 5 user's viewing habits. In addition, the preference database can be periodically updated from 
third-party sources to reflect the aforementioned seasonal or holiday updates. 

Categories in the preference database 1 16 are either predefined, such as those received 
from third-party sources, or are dynamically created from attribute information 107 received for 
programs 105. Categories, and associated values, that are dynamically created are preferably 

20 given a default rating by preference database 116. An example of the preference information 
created by preference agent 1 10 or downloaded to preference agent 1 10 is shown below. In the 
following example, the three columns of numbers in the category statistics and value statistics 
portions indicate weighting (in a range of 0 to 1000) duration watched (in seconds) and amount 
of time that programs matching that particular category or value was available (in seconds). The 

25 information is preferably stored in the form of database records. 



Categories: 

channel 1000 

title 1001 

title-Substring 1002 

30 genrelnfo 1003 

description 1 004 

descSubString 1005 

episodeName 1 007 

type 1008 

35 stars 1009 

director 1010 

YearProduced 1011 
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MPAARATING 1012 
criticRating 1013 
Values: 

Titanic 2000 

5 Ami 2001 

3rd Rock From the Sun 2002 
The Gods Must Be Crazy 2003 

Seinfeld 2004 

Headline News 2005 

10 Bugs & Daffy 2006 

News 2007 

004 2008 

005 2009 
063 2010 

15 49ers 2011 

SITCOM 2012 

COMEDY 2013 

MOVIE 2014 

NEWS 2015 

20 Sanfrancisco 49ers 2016 
A Coke bottle raises 
havoc for a tribe of 

African bushmen 2017 

John Mayers 2018 

25 Lousie Barnett 2019 

Marius Weyers 2020 

Sandra Prinsloo 2021 

Jeff Bridges 2022 

Valerie Perrine 2023 

30 Phil Hartman 2024 

Jamie Uys 2025 

Lamont Johnson 2026 

1981 2027 

1973 2028 

35 1996 2029 

THREESTAR 2030 

TWOSTAR 2031 

NUDITY 2032 

VIOLENCE 2033 

40 ADULTS ITUATIONS 2034 

ATULTLANGUAGE 2035 



Category - Value pairs: 





1001 


2001 




1001 


2002 


45 


1001 


2003 




1001 


2004 




1001 


2005 




1001 


2008 




1000 


2009 


50 


1000 


2010 
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l fifto 901 1 












1003 2013 






1003 2014 






Catefforv statistics* 






1001 




1000 31104 4022280 


1002 




1000 31104 4022280 


1003 




1000 31104 2613384 


1004 




1000 20304 1996596 


1005 




1000 20304 1996596 


1006 




1000 5238 1259028 


1007 




1000 3438 369450 


1008 




1000 13266 812970 


Value Statistics 






2001 


1000 


1638 88074 


2002 


1000 


6714 178560 


2003 


1000 


6552 387054 


2004 


1000 


5400 165600 


2005 


1000 


1600 9000 


2006 


1000 


3600 28800 


2011 


500 


1800 10800 



In the above example, fourteen categories are provided (1000 - 1013) followed by thirty- 
six values. The correspondence between the categories and values (category - value pairs) is 
25 next shown. Data for the categories and then the values is shown next. This data is organized in 
three columns as described above. 

In the embodiment shown in Figure 1 and described above, the ratings for categories and 
values are dynamically generated by the preference agent 110 instead of being stored in 
preference database 116. In an alternative embodiment, the ratings may be stored in preference 
30 database together with the category-value pairs. 

Referring now to Figure 2, preference determination is used to predict a user's 
preferences in the choice of TV programs. This prediction is based on, (i) analy sis of the 
individual users viewing habits, (ii) analysis of the viewing habits of a representative sample of 
users arid (iii) EPG data for the television programs available during the period of the collected 
35 sample. 

The viewing habits of a user are described below in the context of actual programs 
watched by the user. However, other information may be just as useful for the purposes of the 
invention, and includes information regarding programs that are never watched by the user or 
watched only very briefly before changing to another channel, programs that the user schedules 
40 to record, programs that the user records but does not watch or watches only briefly, and 

programs for which the user requested information from a program guide (e.g. an on-screen EPG 
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menu) or for which the user never requests information or for which the user requests 
information but does not choose to watch. 

One method for creating a preference determination is through the use of a Bayesian 
network that is used to predict user behavior, and hence a preference agent, is. a knowledge 
5 based approach. In this method a knowledge engineer interviews an expert about the Field of 
expertise of the expert. The expert and the knowledge engineer determine the distinction of the 
worlds that are important for decision making in the field of the expert. The knowledge engineer 
and the expert next determine the dependencies among the variables and the probability 
distributions that quantify the strengths of the dependencies. 

10 A second method is Bayesian network is a data based approach. In this second method 

the knowledge engineer and the expert first determine the variables of the domain. Data is then 
accumulated for those variables, and an algorithm applied that creates the belief network to 
predict user behaviors from this data. The accumulated data comes from real world instances of 
the domain. This second approach exists for domains containing only discrete variables. 

15 Bayesian methods can be performed by utilizing an algorithm known as the "EM" 

algorithm, which, as recognized by those skilled in the art, calculates the maximum a posteriori 
values ("MAP values") for the parameters of the model to predict user behavior. The EM 
algorithm is described in Dempster. Maximum Likelihood From Incomplete Data Via The EM 
Algorithm, Journal of the Royal Statistical Society B, Volume 39 (1977), incorporated herein by 

20 reference. 

After calculating the probability for each variable, each variable within the belief 
network is then scored for goodness in predicting the preferences of the user by rendering a 
subscore. As recognized by those skilled in the art, the subscores are generated by using the 
marginal likelihood of the expected data at the maximum a posteriori values. 

25 One skilled in the art will recognize that any standard model to predict user behavior 

inference algorithm can be used to perform this step, such as the one described in Jensen, 
Lauritzen and Olesen, Bayesian Updating in Recursive Graphical Models by Local 
Computations, Technical Report R-89-15, Institute of Electronic Systems, Aalborg University, 
Denmark, incorporated herein by reference. 

30 Television viewing habits of the population are sampled to generate representative 

samples of viewer behaviors, 1 17. This task is typically performed by companies such as 

Nielsen Media Research, who collects user behavioral samples to conduct market research. The 

results of the sampling is stored in a viewer behavior database. The format of the viewer 

behavior database is proprietary to the company conducting the sampling. Viewer behavior 
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database contains demographic information about the people in households participating in the 
sampling. This information includes but is not limited to race, age, annual income and gender. 
Viewer behavior database also contains information about all the television programs each 
viewer watched- during the period of sampling. 
5 Program Information and the schedule of television programs is available in the EPG 

database, 104. Program Information for a television program contains information about various 
attributes of the program which includes but not limited to the title, program type and program 
category of the television program, arid also the actors acting in the television program. 

Figure 3 lists two examples of program information, example 1, 124, describes the 

10 program information for a audio visual television program, and example 2, 125 describes 
program information for a graphical text based television program. 

Viewer behavioral database is analyzed 1 1 8 to identify traits of TV programs and 
determine how such identified traits influence viewing habits in the representative sample of 
users. These traits and their influence amongst the viewing population are then used in aiding 

15 the process of predicting an individual user's choice in TV programs. Figure 4, 126, lists some 
examples of such traits. Some of these traits are derived from program information, 104. Some 
of these traits are derived by looking for users liking for association of multiple traits or 
association of traits with other viewing parameters which includes but is not limited to time, day 
of the week, holiday, working day and weather season. Some of these traits are derived by data 

20 analysis of Viewer behavioral database. Any given program would exhibit varying degrees of 
the above identified traits. This is expressed as the trait-ness of the trait in the program, e.g. trait- 
ness of comedy in Seinfeld may be 1.2 and trait-ness of comedy in Mad About You may be .79. 
The trait-ness of a trait in a program is computed as a part of the data analysis, 1 1 8. 

The present invention also provides for a methodology for determining viewer 

25 preferences. In one embodiment; an individual viewer picks one show to watch out of the 

collection of available program by evaluating a stochastic liking function for each program and 
choosing the program with the highest score. The liking function is modeled as an aggregate of 
liking for a specific trait and the degree to which that trait is exhibited by that program. The 
liking function can be computed as: 

30 l(p) = ZX(tn)* tn 

Where 

l(p) represents the liking for program p 

X(tn) represents the liking for a specific trait tn 

tn represents the degree to which the Trait tn is exhibited by p 
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When a viewer watches television, it is assumed that the viewer chooses a program for 
which the value of l(p) is the highest. For any given viewer, the better the values of X(tn) 
represent its actual liking, the more the accuracy in determining its preference for TV programs. 
Thus for any given user if all the relevant traits are known along with the exact liking of each of 
those traits, the above Liking Function would always accurately predict the liking of any given 
program. 

However, identification of all the relevant traits and the determination of the exact liking 
of each viewer is a non trivial task and arrived at iteratively by a process of regression analysis. 

While a majority of the traits relevant to a specific user may be derived directly from the 
EPG information itself, some of the traits are discovered by analyzing the user's viewing habits 
over time. Traits that are suitable for discovery by a process of regression analysis are generally 
hidden or associated traits. 

Hidden traits are those which influence a user's viewing habits but which can not be 
derived from the EPG information. For example, sitcoms featuring a specific ethnic background 
would resonate more with viewers of that ethnic background. Another example of the effect of a 
hidden trait could be the strong liking of the sitcom "Frasier" amongst the set of users who have 
a strong liking for the sitcom "Cheers." Assuming that the names of actors performing in a 
sitcom are not be available in the EPG data, this affinity towards both the sitcoms by the same 
set of people may be explained by the presence of some trait commonly exhibited by both the 
sitcoms, namely the presence of one of the central characters in both the sitcoms. Such hidden 
preference criteria would have to be captured by adding the hidden traits in the computation of 
the Liking Function. 

Associated traits - Traits which have a different influence on a user's viewing habits 
when combined with other traits. For example a user would have a certain liking for any given 
Seinfeld.episode, and a certain liking for any premiere sitcom being aired for the first time. 
However, its liking for a premiere episode of Seinfeld may be sufficiently large enough to 
require an additional trait, "new Seinfeld", to fully explain its liking for a premiere episode of 
Seinfeld. 

The liking of each trait for a given user has to be similarly refined from initial 

approximations by regression analysis. Examples of traits and the associated liking for a sample 

user include but are not limited to those listed in Figure 4. The trait "NBC <=> NEWS" is an 

example of an associated trait where the traits being associated are "Channel NBC" & "Program 

Type News." Users liking for NEWS programs may be 1 & the preference for the "NBC 

Channel" may be 2 whereas its preference for NEWS on NBC may be 13, i.e., this user does not 
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always watch news prog rams or programs in general on the NBC channel, however he has a 
sirong preference for NEWS programs on the NBC channel. If only its liking for the "Channel 
NBC & "Program Type News" are considered, its preference for "News on NBC*' would be 
unexplainable. 

5 This procedure to identify traits is first carried out using the viewing habits of the 

representative sample along with the determination of the distribution of likings of each trait 
within the representative sample. 

Some of the outputs of the analysis 102, of the viewing habits of a representative sample 
are a set of preference determination parameters viz. (i) the traits which are exhibited by 
10 recurring program and the degree to which such a trait is exhibited, and (ii) a distribution list of 
the likings amongst the viewing population of each of the above trait 

One methodology to derive these parameters is outlined in Figures 5(a) and 5(b). 
Different phases of the flow chart are explained further in the following sections. An initial s^t._^ 
of liking values are computed for each user, 129. This initial liking value may be arrived at in 
15 various ways. One of the ways could be to base it on the amount of time for which a given traU 
was watched as opposed to the amount of time it was available during the user's viewing period. 

Using these initial liking values for the traits the Liking Function l(p) is computed for all 
the programs that were available during the times the user watched a given program. These 
programs are then arranged in descending order, 140, of liking value for a time period during 
20 which the viewer watched a program, 141. The actual program the user watched at a given time 
is highlighted, 143. If all the highlighted programs in Figure 6 had the highest score, the Liking 
Function correctly identified the user's preference. If the program watched by the user ranks 
below some other competing program available at that time the liking values used do not 
correctly reflect the user preference. By way of example, but not by way of limitation, if there 
25 are N programs ranked above the program actually watched, the error in the Liking Function for 
this program may be called N, 142. Such values of N are determined for each program watched 
by the user by comparing it with the Liking Function of the other competing programs. In the 
example illustrated in Figure 6 values of N are computed for the times when the user watched 
TV between the hours of 10:00-10:30, 10:30-1 1 :00, 1 1 :00- 1 1 :30. Using regression analysis, the 
30 liking values of each of the traits are adjusted incrementally to reduce the average value of N. 

The set of liking values which yield the lowest value of the average of N are considered the best 
set of liking values for that user. 

One methodology of the regression process is illustrated in Figure 7. At the beginning of 

the process liking values for traits a, b, c, are Xal, Xbl, XcL. ..(144,145). Xal represents the 
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liking of trait, "a," Xbl represents the liking of trait M b" and so on. The Liking Function of all the 
relevant programs are computed using this initial set of liking values. The liking value \a\ of the 
first trait "a" is changed incrementally to determine the average value of N. The value Xa2, 147, 
which yields the lowest average value of N, is taken as the new liking value for the trait "a." 
5 This new value liking value Aa2, 148, now replaces the old value and is now used to determine 
the remaining X values. The above set of liking values computed would predict the users 
preferences with a high degree of accuracy if set of the traits considered in the Liking Function 
includes all the traits that are relevant to a user. The average value of N not converging (to 0 or 
some other acceptable value) would indicate that not all traits that are relevant to the user have 

1 0 been considered in the computation of the Liking Function. Introduction of additional traits, 
either associated or hidden, is used to improve the determination of the user's preference. 
A determination of associated traits is achieved by a variety of different methods. 
One method is through the application of heuristic rules of thumb where the associative 
value of a number of traits is not reflected in the program information obtained from an EPQ but 

15 is relevant to human viewing habits. For example, a user who has a liking for Seinfeld would t . 
most probably have a much higher preference for a premiere or new episode of Seinfeld. Such 
heuristic rules of thumb regarding the associative value of a number of traits may be passed to 
each set top box via the Head End- Another method of determining associated traits is with an 
algorithmic search which looks for common traits in programs and introduces new associated 

20 traits to try to improve the Liking Function for a user. 

Determination of hidden traits in different programs can be achieved in a number of 
ways. One of these methods is illustrated in Figure 8. The possibility of the presence of a hidden 
traits exists whenever there exists a strong co-relation between any two traits present in different 
programs. The y-axis represents an increasing liking value for higher values of y. Every point on 

25 the X-axis in Figure 8 represents a user arranged such that the liking value of the trait H tn M 
increases with higher values of x. If for the same values on the x-axis the liking values of a 
strongly co-related trait ,, tm ,, are also plotted, it will exhibit a relation to the liking graph of the 
trait M tn n . The co-relation between any two traits may be explained by he presence of a common 
hidden trait. Thus traits "tm" and "to" may be expressed mathematically as 

30 tm = tx + tm* 

tn = Ctx + to' 

where C is some constant indicating the amount of co-relation between traits "tm" & "tn". 

Hidden traits can also identified by applying rules of thumb or some other appropriate 
manner. 
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While ail programs of a specific genre exhibit some common traits, the degree to which 
these traits are exhibited vary from program to program. This degree of traitness for recurring 
programs can be quantified such that it best explains the viewing choice of watching or not 
watching that program in the representative sample. For example consider a user who has a 
5 certain liking □ for comedy. The users' decision to watch a comedy program would be 
influenced by whether the amount of comediness exJhibited by that program crosses some 
threshold of comediness that is determined by its liking for comedy. 

One method to determine the traitness of a trait T in a given program N is illustrated in 
Figures 9(a)-(c). The highlighted program represents the programs actually watched 156 by a 

10 user. All available programs are arranged in decreasing order of rank 154 as computed by the 
Liking Function. In the case of User 1 155 the Program N 156 which was watched by him 
appears third in rank whereas the Liking Function should have actually ranked Program N at the 
top. In this case the margin of error 1 57 in the traitness of trait T is 3 . In the case of User 4 the 
program N watched by the user was ranked the highest. The margin of error in the traitness of 

1 5 trait T is 0. In the case of user 2 where he did not watch Program N, it was ranked the highest. \ 
The margin of error in this case can be considered to be a constant K. A suggested value of K 
may be the number of programs available to the user at that time. Similarly for user 3, the 
margin of error may be considered K-l. Such margins of error are computed for all users who 
watched the Program N and the average margin of error is computed. 

20 Using regression analysis, the traitness value of the trait T is adjusted incrementally to 

reduce the average value of the margin of error. The value of traitness which yields the lowest 
value of the average of the margin of errors is considered the best value of the traitness of the 
trait T exhibited by the program N. . 

, Traitness values may also be assigned by rules of thumb or some other appropriate 

25 manner. 

One of the outputs of analyzing the viewing habits of a representative sample is a 
distribution list of the likings amongst the viewing population of each trait. This information is 
presented schematically in the Figure 10 and is made available to every individual user Set Top 
Box along with the broadcast TV programs and program information. "Number of traits", 164, . 
30 represents the total number of traits that have been identified after analyzing the viewing habits 
of the representative sample. The distribution of liking of each trait is provided in a "Trait 
Record", 165. Information included in a trait record include the name of the trait, 167, the type 
of the trait, 168,( whether hidden, associate, etc), the liking distribution of the trait 170, and the 
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distribution.parameters, 172, of the trait. Examples of various possible values in a trait record 
are provided in Figure 11. 

Figure 12 illustrates one schematic representation of traits present in recurring programs. 
This information is passed along with the above liking distribution of traits. The "Number of 
5 programs' 1 173 represents the number of programs for which trait information is sent. Relevant 
- traits and the degree to which they are exhibited are included in the "Program Record" 174. The 
"Program Identification" 177 information is used to identify the TV program to which this 
record pertains. An example of a Program record for a Seinfeld Episode is given in Figures 13. 
As given in the example certain traits such as "Program Name - Seinfeld", "Channel — NBC", 
10 "Program Type - Sitcom" can be derived directly from the EPG and are called static traits. 
Additional traits such as "comedy" which store the degree of comediness associated with a 
Seinfeld Program may also be passed as a part of the EPG data. 

Referring again to Figure 2, the program content and the program information (in the - 
form of an EPG) are transmitted from a Broadcast Head End along with preference 
1 5 determination parameters, 1 19. Examples of preference determination parameters include but^re 
not limited to (i) the traits which are exhibited by each program and the degree to which such a 
trait is exhibited and (ii) a distribution list of the liking values amongst the viewing population 
of each trait. 

This information is received in each household by a program selection device 100, that 
20 include a preference Determination Module, a personal preference database, 1 16, a storage 
device and a display device 108. The personal preference database 1 16, is used to store the 
results of the analysis of the individual user's viewing habits. The storage device stores programs 
as per the user's specific requests or programs recommended by the preference determination 
f module." " 

25 Program Selection Device 100 monitors each user's viewing actions and selection of TV 

programs being watched. This is stored in a storage device or in memory in the form of selection 
history data 189 (see Figure 15). The schematic representation of selection history data is given 
in Figure 14. The "Number of selection records", 180 represents the number of user selection 
choices stored in the selection history data. Each selection record 181, contains the information 

30 on the actual programs watched 185 along with information on the competing programs 

available at that time 186. Storing program information is required as the EPG may not be able 

to provide information on past programs. Information on these program may be obtained 

directly from the EPG data when the information is obtained while the program is still current. 

The time 183 and duration 182 for which a program was watched also form a part of the 

-21 - 

8NSOOCID: <WO 011725QA1J_> 



WO 01/17250 PCT/US00/23868 



selection record. As illustrated in Figure 15, a uses selection history 189 is derived from each 
choice 187 made by the user along with program information from the EPG 104. 

Onq process of learning the user preferences of a specific individual is illustrated in 
Figure 16. The liking distribution in the representative sample for traits identified by data 
5 analysis, 1 1 8, are used by the Liking Function to minimize the effect of error introduced because 
of lack of sufficient sampling in the computation of the liking of the identified traits by an 
individual. An individual user's response to setup question 190 may also be factored in 
determining initial values of liking for that individual 1 92. 

A user selection history 1 89 is maintained for a fixed number of hours. The number of 
1 0 hours for which the user selection history is maintained can be preferably changed to have an 
increased rate of learning during initial days of a new user using the device. Average Error is 
computed N 1 94, in a similar manner as for users; in the representative sample, as previously 
described in Figure 6. If the average error is greater than a tolerable limit 195, new liking values 
are computed 198. Entries in the user selection history are moved to the Past selection historyTff 
15 the Average error remains under a tolerable limit the liking values are computed only after a 
predefined number of hours 196. 

Another embodiment of the selection record stores a program ID instead of the entire 
program information. In this embodiment, the actual program information is broadcast at 
predetermined times from the Head End where each program is identified by the same program 
20 ID stored in the selection record. In this embodiment, the computing 198 of liking values for 
traits are performed only at this predetermined time. 

Referring now to Figure 17(a), after each computation of liking values, entries in the user 

selection history is moved to the Past selection history. Selection history is partitioned as user 

selection history 189 and Past selection history 216 to optimally store the section history 

25 records. Selection history contains information about all selections made by the user between 

two computations of Liking values. Past selection history maintains the most relevant 

information about the most relevant selections made by the user in past. The number of records 

in the Past selection history can be advantageously configured to suite the memory available for 

storage of such records. Relevancy in this context is the importance of the record in computing 

30 the liking values and is dependant on many parameter including but not limited to the age of the 

record 200, the repetition rate 203 of traits contained in programs 204 in the selection record. 

The relationship between relevancy and age 205 of the record is shown in Figure 17(b). The 

most relevant records are the most recent records. The relationship between relevancy and the 

repetition rate of traits contained in programs in the section record is shown in Figure 17(c). As 
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the repetition rate decreases, the relevancy increase 206, till certain limit after which the 
relevancy decreases 207. A good example to illustrate the reasoning behind this relationship 
would be relevancy of keeping records on user selection when Seinfeld a weekly program is 
available and the relevancy of keeping records on user selection when Olympics a 4 yearly 
5 program is available. While it is important to keep records about Seinfeld to compute likings of 
various traits contained in Seinfeld, it is pointless to keep records about viewer's liking for the 
last Olympics. Consider a user who does not have a liking for movies except on Fridays. To 
accurately determine the user likings of trait of "Friday night-ness M of a movie, the selection 
records pertaining to its viewing actions on a Friday have to be maintained for at least a number 
10 of weeks. However if the repetitive nature is too far apart in time, the relevancy decreases 
sharply. The "Relevancy versus Repetitive Rate of a Trait 1 ' graph in Figure 17(b) measures 
increasing values of relevancy for higher values on the y-axis. The x-axis measure decreasing 
rates of repetitiveness of a trait in a program. As shown in Figure 17(a), beyond a certain 
threshold of repetitive rate, the relevance decreases sharply. ~~ 
1 5 The process of updating of past history is shown in Figures 1 8(a) and 1 8(b). The 

processing is done for every record in the user selection history. If there is enough memory to 
store the new record in the past selection history 2 1 8, the record is removed from the user 
selection history and added to the past selection history 222. If there is not enough space in the 
past selection history 218, the records in the past selection history are sorted based on relevancy 
20 220, and :he least relevant record is deleted 221 to make space to add the new entry 222. 

In another embodiment of the present invention, the number of available programs 186 
stored in the past selection history can be limited to an optimal number to make best use of the 
memory available (see Figure 18). In this embodiment only the programs with higher values of 
liking, representing the traits which have significant liking values, are stored in past selection 
25 history record. 

Computing of Liking values for different traits performed by the Program selection 
device 100 is shown in Figure 19. This process is performed 1 98 during the learning process as 
explained by Figure 16. Regression analysis is performed to minimize error in prediction of 
viewer behavior stored in user selection history and past selection history by using current liking 
30 values as the starting point. The regression analysis process 223 is the same as the one 

performed in representative sample 131. If there are any selection record for which the error is 
more than a predefined threshold 224, the selection record is examined for the presence of 
possible associated traits 225. The rules which defines the presence of possible associated traits 

may be rules of thumb. Possible association of traits can also be discovered by looking up the 
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list of associated traits which were discovered from the representative sample. Liking for 
possible associated traits are computed by regression analysis 226. If the computed liking is a 
significant value 23 1, the liking for trait is entered 230 into the preference database 1 16. Other 
parameters like the repetition rate for the trait also is entered into the database 230. The 
5 repetition rate is computed by looking at the repetition rate of the trait in user history and past 
history. If there are no previous selection record with this trait, then the repetition rate is 
assumed as a predefined value. As new selection records are created, the repetition rate is 
updated. At the end of the process (illustrated in Figure 19) of computing the liking values, a 
weighted average of the current liking values and the old liking values is computed. This forms 

1 0 the new set of liking values. Learning rate can be increased or decreased by changing the weight 
of the current liking values. 

A flow chart of computing scores for programs for future prediction is illustrated in 
Figure 20. As described earlier Liking 23jS for a program 232 is computed by evaluating the - 
function l(p) = Z A.(tn) * tn. X(tn) is computed as a weighted average of liking for the trait tn for 

15 the user 234 and the liking for the trait tn for the general population 235. 

In another embodiment of the present invention which is suited to function on program 
selection devices operating on households with multiple users, the viewer is asked to input the 
number of viewers in a household during the initial setup process. Each viewer can answer a set 
of setup questions which would capture their liking for representative traits. An initial set of 

20 possible liking values are created for each viewer. Learning process updates each viewer's 
preference database separately after identifying each viewer session correctly. Viewer 
identification is done either by explicit input from the viewer identifying the viewer or 
automatically by identifying the viewer by monitoring the television viewing behavior of the 
: viewer. To identify a viewer automatically, the error value between predicted and actual is 

25 computed for all viewers. The viewer with liking value which yields the least error value is 

chosen as the possible viewer. The certainty of this decision is expressed as a probability which 
is computed as a function of the differences between error values for different viewers in this 
household. 

The objective of the present invention is to determine demographic characteristics of a 
30 user by analyzing the users viewing habits in juxtaposition with viewing habits of a 
representative sample of users whose demographic characteristics are known. These 
demographic characteristics of a user collectively constitute a demographic profile. Upon 
successful creation of an accurate demographic profile for a user, the present invention can 
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receive a collection of possible ads and show individual users only those which are targeted to a 
matching profile. 

Some example of demographic characteristics include, but are notlimited to, a users 
gender, race, age and income. The output of the analysis of viewing habits of the representative 
5 sample provides a basis for determining demographic characteristics of the individual user. 

The TV program viewing preference information chosen in the analysis of viewing 
habits, of the representative sample, plays an important role in determination of a demographic 
characteristic. Different TV programs might be required to detej-mine demographic 
characteristics. Typically TV programs where a majority of the viewing audience shares a 
1 0 common demographic characteristic, in a proportion which is different from what is observed in 
the general population, are best suited in the discovery of demographic characteristics. For 
example to determine the "income' 1 demographic characteristic consider graph (i) in Figure 21a, 
The x-axis represents the annual" salary in ( increasing order of x. For any point in the x-axis, the 
y-axis represents the probability of finding an annual salary of x in the representative sample ' ' 
15 (which is the same in the general population). Now consider graph (ii), where the dotted curve* ' 
plots the same income distribution represented in the graph (i), the first solid curve (the leftmost 
curve) shows the income probability distribution of the viewing audience for : jSograrnTl in the 
representative sample and the second solid curve represents the income probability distribution 
of the viewing audience for program P2 in the representative sample. Analysis of the viewing 
20 habits of an individual user for the above programs PI & P2 could be used to determine his most 
probable annual income. Using Bayesian inference this probability may be expressed by the 
following mathematical formula 

P(Ixy| Wp l)=P(Wp 1 |Ixy)P(Ixy)/P(Wp 1 ) 
where 

25 - Ixy represents an annual income range of x-y 

• Wp 1 represents watching program p 1 

- P(Ixy|Wpt) represents probability of income of Ixy given the fact that program 
pi was watched 

- P(Wpl)Ixy) represents probability of watching program pi given the fact that - 
30 the annual income is Ixy 

- P(Ixy) represents probability of income of Ixy in the representative sample 

- P(Wpl) represents probability of watching program pi in the representative 
sample 
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Thus for a user who watches program p 1 , the probability of his income being within a 
certain range can be determined. • 
Programs whose viewing audience display a demographic characteristic in the same \ 
proportion as is observed in the representative sample (which is the same in the general 
5 population) do not contribute significantly in determining demographic characteristics. Consider 
the graphs in Figure 2 1 b. Graph (i) shows a bar graph which represents the probability of a 
person in the representative sample being male or female. The graph (ii) shows the probability of 
person being male or female within the viewing audience of program P3 which is very similar to 
what is observed in the general population. If Bayesian inference is used to determine the gender 

10 of a person who has watched program P3, the probability of the person being a male or a female 
would be the very close to the probability of person being a male or a female in the general 
population (which is already a known value). Nothing significant is gained by analyzing the 
viewing habits of the individual for the program P3. On the basis of the graph (iii) on the other 
hand, analyzing viewing habits for program P4 would yield additional certainty in determining^ 

1 5 the gender of a person than what is already known about the gender distribution in the general* * 
population. 

The detennination of wrjich~programs yield high probability values in determining a 
demographic characteristic of a user may be done by applying rules of thumb through an 
algorithm which chooses the program on the basis of how much a demographic trait in it 
20 viewing audience differs from what is observed in the general population by some other means. 

Depending on the program content and the demographic characteristics of its viewing 
audience, the same television program maybe used to determine one or more traits. It is also 
possible that a completely different set of programs are required to study different demographic 
characteristics. 

25 Furthermore, the present invention contemplates analyzing other preferences of a user in 

addition to or in place of the users television viewing preferences. Such information may 
include, but is not limited to, musical preference, reading preferences, shopping preferences (a 
very large category that comprises, among many others, preferences in clothing, furniture, 
jewelry, decorations, appliances, electronic equipment), political and religious tendencies, etc. 

30 Such information may be obtained from any source, but in one preferred embodiment it is 

obtained from on-line sources (typically accessible as world wide web sites) that the user is a 
member of or visits frequently, such as music clubs, book clubs, other special-interest sites (e.g. 
a site devoted to a particular sport or hobby), news sites, etc. Other information descriptive of 
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the users preferences includes user subscriptions, such as to newspapers, magazines newsletters 
shopping catalogs, and other special interest publications. 

While it is possible from Bayesian inference to ascribe a certain preference for a specific 
show based on a users demographic characteristics, it may not the only factor which explains 
the users choice of a particular show. For example, consider the graph (ii) in Figure 21a. It might 
be possible for a user, Ul, having income less than xl having a strong preference for program 
P2 for unknown reasons. So just using Bayesian inference to arrive at Ui's most probable annual 
income based on his viewing habits of only a single program P2 would not yield the most 
meaningful result To strengthen the belief of a Bayesian inference, the users viewing habits 
have to be analyzed for a set of programs where the viewing audience of each program from that 
set displays a similar demographic characteristics. The programs of the set are so chosen that 
degree of co-relation of traits exhibited by the programs in the set are minimal. Each program 
that the user views from this set of programs adds towards strengthening the belief of the 
Bayesian probability of the user possessing that demographic characteristic. 

To explain the need for minimal co-relation consider the previous example. If another " * 
program P3 which is very similar to P2 in content and has the same demographic traits in its 
viewing audience as P2 is available to user Ul, he would have a similar liking for P2 as P3. In 
this scenario strengthening the belief of the Bayesian probability by the user Ul watching 
program P3 is not a positive contribution. However if program P3 were very different in 
contents and the traits that it exhibits, the probability of Ul choosing P3 is greatly diminished 
and keeps the belief in the Bayesian probability from erroneously rising. In this situation the 
probability of a user who actually possessed the demographic characteristic of watching both 

-programs P2 &: P3 would be quite high and the strengthening of the belief would be a positive 

fCOToibutipii. 

Thus a Belief Function for the Bayesian probability for a demographic characteristic del 
may be computed in many ways which includes but is not limited to 
BF(dcl) - MAX( bp(wl), bp<w2) ... bp(wm) )*crl + 
MAX(bp(wk), bp(wk+l) .., bp(wk+m) )*cr2 + . . . + 
MAX (bp(wn), bp(wn+l) bp(wn+tn) )*cm 
where 

- bp(wn) is the Bayesian probability of a user having that demographic characteristic 
given the fact that he watched program wn 
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- (wl, w2, ... wm), (wk, wk+1, wk+m), (wn, wn+U .... wn+m) 

are sets of programs where any member of a set has a high degree of co-relation with another 
member of the same set, but a very low degree of co-relation with a member of another set. 

- MAX(bp(wn), bp(wn+I) ... bp(wn+m) ) represents the maximum 
5 Bayesian probability value of all possible values within that set 

- crl represents a co-relation co-efficient of the set (wl 5 w2> ... wm), c2 represents co- 
relation co-efficient of the set (wk, wk+1, wk+m) and so on. Elements of the same set have 
the co-relation co-efficient 

The Belief Function for any demographic characteristic is illustrated by the S curve in 
1 0 Figure 23 a, The x-axis represents increasing values of the Belief Function for a demographic 
characteristic for increasing values of x. The y-axis represents increasing values of the 
probability of a user displaying that demographic characteristic for increasing values of y. As 
shown in Figure, the probability does notJncreasing any further for higher value of the Belief 
Function. 

15 Some of the output of the analysis of viewing habits of the representative sample are * 

schematically illustrated in Figure 23b: 

Display of advertisements hy a program selection device in accordance with the 
invention is done when a users computed demographic profile matches the target profile of the 
advertisements. Some of the elements contained in an advertisement are the advertisement 
20 contents (may consists of video clips, audio clips, and/or graphical or textual information) and 
the meta data which includes information on the demographic profile to which this 
advertisement is targeted. 

The meta Data contained in an? advertisement is represented schematically in the Figure 
23c. Here dumber of Target Records" refers to the number of "Target Records" included in this 
25 "Target Ad Meta Data". Each "Target Record" refers to a demographic characteristic that this 
advertisement is targeted to. The Target relationship rule" is used to determine the relationship 
rule. For example, the advertisement may be targeted towards users who satisfy any one of the 
following criteria: 

income level is between Ix & Iy 
30 - ethnic background is ebl 

The "Target relationship rule" would be used to specify the above relationship. 
The "Demographic Trait Data" determined by analysis of viewing habits of a 
representative sample with known demographic characteristics and the "Target Ad Meta Data" 

are broadcast to a Program Selection Device from a Broadcast Head-end along with program 
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content and.EPG data. This is illustrated in Figure 22. This is received by each user at each 
Program Selection Device to determine the user demographic profile. Broadcast of the 
"Demographic Trait Data 11 and "Target Ad Meta Data" along with the advertisement content 
may be done on a periodic basis, or may be made always available on the broadcast networks or 
5 through some other communication mechanisms on an as-needed basis. 

At the Program Selection Device the "Demographic Trait Dam" and "Target Ad Meta 
Data" are collected and stored in a storage device such as, but not limited to, a hard disk, flash 
ROM, or main memory. Collection of this data may be done at fixed time periods or whenever 
suitable depending on the embodiment chosen. For each trait record in the "Demographic Trait 
10 Data", the Program Selection Device goes through past Selection History. Each program on the 
"Trait Value Record" that is available in the past Selection History is used in the computation of 
the Belief Function. The Belief Function Distribution graph in the "Trait Value Record" is then 

,4 

used to determine the probability of the user having that demographic characteristic. If an earlier 
probability for this demographic characteristic is available a weighted average of the old 

15 probability and the newly determined probability is taken and stored. Each targeted 

advertisement where the "Probability satisfaction criteria" of the "Target Record" is met by 
user's demographic profile is chosen for display at the appropriate time. 

As discussed elsewhere in the disclosure, in another preferred embodiment the present 
invention enables providing other targeted content to the user in addition to advertisements. 

20 Referring again to Figure 1, television control system 100 is preferably implemented by 

way of a general purpose digital computer and associated hardware that executes stored 
programs to implement the functions shown within block 100. The exact hardware and software 
"platforms on which the television control system 100 is implemented is not important and may 
take a variety of forms. For example, television control system 1 00 may be implemented in a 

25 ' set-top box such as may typically used by individuals in the home to receive CATV signals. 

Another implementation of television control system 100 is in the form of a personal computer 
configured with the requisite hardware and software to receive and display television signals. An 
example of a set-top box that maybe programmed in accordance with the principles described 
herein is described in the following documents by IBM Microelectronics: "Set-Top Box 

30 Solutions", Product # G522-0300-00 (Nov. 19, 1997); "Set-Top Box Reference Design Kit", 

GK10-3098-00 (April 15, 1998); "Set-Top Box Peripheral Chip", GK1 0-3098-00, (April 15, 

1998); "Set-Top Box Solutions: Helping Customers Meet the Challenges of Convergence", 

G522-0300-00 (Nov. 19, 1997); and "The challenges of convergence for Set-Top Box 

manufacturers", G599-0302-00 (Nov. 19, 1997). An example of an Application Programming 
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Interface (API) available for set-top boxes which can serve as a platform for the embodiments 
described herein is described in M AP1 Requirements for the Advanced Set-Top Box" published 
by OpenCable (Oct. 21, 1997). An example of an operating system incorporating functionality 
to support the embodiments described herein is available from OpenTV, Inc. and is described in 
5 the following Technical White Paper publications by OpenTV,. Inc.: "OpenTVTM Operating 
Environment" and "Application Development for OpenTVTM/ 1 An advantage of such an 
operating system is the support provided in the form of function calls to obtain attribute 
information 107 from the signals 104. Alternatively, a general purpose operating system such as 
the Windows NT operating system from Microsoft Corporation may be used in conjunction with 
1 0 additional software that provides the functions required to extract the necessary information 
from attribute information 107 and to perform other manipulation of the received signals 104 
and the stored information 105. 

Storage devices 106 may include a variety of different types of storage devices. For , ^ 
example preference database 116 may be stored in a non- volatile, random-access semiconductor 
1 5 memory. Television programs 105 and attribute information 107 may be stored on storage . , 
devices having greater capacity such as a conventional magnetic, hard disk drive. In general, 
storage devices 106 are understood to encompass a variety of storage devices. The exact form of 
the storage devices 1 06 is not critical so long as the storage devices have the capacity and speed 
to store the necessary information. Storage devices 106 may also comprise a conventional video 
20 cassette recorder (VCR) which operates under control of system 100 to store television programs 
105 and attribute information 107 on conventional magnetic tape. 

For the purposes of the present description, the television control system 100 is 
presumed to be integrated into, or coupled to, a system including a tuner and other functions 
necessary to receive television signals and to extract the attribute information 107 from the 
25 television signal and to perform other functions typically associated with the receipt and viewing 
of television si gnals . In certain embodiments, television control system 1 00 may operate in 
conjunction with a database agent that facilitates interaction with preference database 1 16 by 
causing storage and retrieval of information to or from the database in an optimal manner. The 
preference database 1 1 6 may be implemented by a commercially available database product • 
3 0 such as the Oracle Light database product available from Oracle Corporation which also 
incorporates the functionality to implement the data base agent described above. 

Recording manager 1 12 causes recording of programs 105 by periodically initiating a 
sequence of steps shown in Figure 24. At 201, recording manager 1 12 sends a request to 

preference agent 1 10 for ratings of all programs at a particular time (X), or alternatively, for 
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ratings of ail programs within a particular time period (X). By way of example, the steps shown 
in Figure 2 may be performed every six hours. In certain embodiments, the frequency with 
which the steps in Figure 24 are performed may be changeable by the user. Preference agent 1 10 
responds at step 202 by providing ratings, from preference database 1 1 6, for each program 
5 received from recording manager 112. Recording manager 1 12 then causes recordation of the 
programs at time X, or within time period X in accordance with the ratings received from 
preference agent 110. Specifically, program: 8 ; having the highest rating are given highest 
preference for recordation and programs having the lowest rating are given lowest preference to 
recordation. The recordation is subject to storage capacity constraints. For example, if the 
1 0 highest rated program is one-hour long, and only thirty minutes of recording space is available 
on storage devices 106, then the one-hour program is skipped and the highest rated thirty-minute 
program is recorded. 

Highest priority for recording of programs is given to programs specifically requested by 
the user. For example, if the user identifies a particular program for recording, such as by ' 

15 specifying the date, time and channel, or by specifying an identification code for the program,- * 
recordation of that program is given priority over programs rated by the preference agent. Next 
highest priority is given to programs matching particular category-value pairs specified by the 
user. For example, if the user does not identify a particular program, but specifies that one-hour 
long documentaries pertaining to travel should be recorded, then recordation of programs 

20 matching such category-value pairs is given priority over programs rated by the preference agent 
1 10. In alternative embodiments, relative priority between user-specified programs, user- 
specified category- value pairs and programs rated by the preference agent 1 10 is changeable by 
the user. 

Recording manager 1 12 manages storage capacity on storage devices 106 by causing 

25 deletion of television programs 105 in accordance with ratings of such programs generated by 

preference agent 1 10. This is performed in a manner similar to that explained above for 

determining which programs to record. Figure 25, which shows the steps taken by recording 

manager 1 12 to determine which programs to delete, is similar to Figure 24. At step 301, 

recording manager 301 requests ratings from preference agent 1 1 0 of all programs stored on 

30 storage devices 106. At step 302, preference agent 110 responds by providing the requested 

deletion ratings. At step 303, recording manager 1 12 responds by causing deletion, when 

needed, of programs in accordance with the deletion ratings received from the preference agent 

110. Specifically, when additional space on storage devices 106 is required to record one or 

more additional programs, recording manager 1 12 causes deletion, or overwriting of programs 
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having the lowest rating first. Thus, stored television programs which are determined by 
preference agent 1 10 to be least preferable, in relation to other stored television programs, are 
deleted or replaced first, and those determined to be most preferable are deleted or replaced last. 
Deletion of programs occurs only when required. Advantageously, this results in storage device 
5 106 typically being filled to maximum capacity, thus providing the user with as wide a variety of 
programs as possible. The user can specify programs that are to remain on the storage device 
106. Such programs are not deleted by the recording system 100 in the steps shown in Figure 25. 
In addition, the user can specify programs that are to be deleted, and therefore override the steps 
shown in Figure 25. 

10 In certain embodiments, the preference database is used by system 1 00 to alter the 

manner in which information about currently broadcast programs is presented to the user. For 
example, in such embodiments, the preference database is used to rearrange the order in which 
currently broadcast programs are presented to cause programs having attribute information 107 
rated highest by preference database 1 16 to be presented first. Alternatively, the preference 

1 5 database 116 can be used to organize information regarding the currently broadcast programs 
according to the liking of various traits stored in the preference database. 

Figures 26a and 26b illustrate alternative hardware configurations for systems employing 
the principles of the present invention. Figure 26a illustrates a hardware configuration that 
supports storage and retrieval of digitally encoded audio and video. Interface 902 is a standard 

20 digital cable or digital satellite input interface. Interface 906, which is the hardware interface to 
storage devices 106 preferably takes the form of an IDE or SCSI interface, or the proposed 
IEEE- 1394 interface. Interface 906 is anNTSC or PAL encoded video interface. If the television 
signal 104 takes the form of an analog signal, as in the case of most current television broadcast 
signals, and CATV signals, then the signal 104 must be digitized and generally compressed (for 

25 example, by the MPEG-II standard) before storage on a digital storage medium such as shown in 
Figure 26a. 

Figure 26b illustrates an embodiment using an analog storage device 106 such as a 

conventional VCR. If the television signal 104 is analog then the interface 910 takes the form of 

a conventional NTSC or PAL interface. If the television signal 104 is digital then the interface 

30 910 takes a form as interface 902 shown in Figure 26a and a digital-to-analog converter is 

required to convert the received signal to analog form before storage on storage device 106. 

Figure 27 illustrates operation of an automatic pause-record feature of preferred 

embodiments. If a user is watching a currently broadcast program and wishes to stop or 

temporarily pause viewing of the program, the recording system 100 advantageously allows the 
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program to be recorded so the user can resume viewing the program at a subsequent time. As 
shown in Figure 27, at 1002, the user is viewing a currently broadcast program. Generation of a 
pause input at 1004 by the user, such as by pressing of an appropriate button on a remote control 
coupled to the recording system 100, causes the system 100 to cause at 1006, recordation of the . 
5 program being viewed by the user. The user is then free to watch another program or stop 

watching the television 108 altogether. At a subsequent point in time, if a resume viewing input 
is received, such as by pressing of an appropriate button on the aforementioned remote control, 
then at 1010, the recording system 100 causes the program recorded at step 1006 to be retrieved 
and shown on the television 108 from the point the recordation was initiated at step 1006. If the 

10 program is still being broadcast when step 1010 is initiated, then recordation of the program 

continues by the system 100. The user thus can easily interrupt viewing of a currently broadcast 
program and resume subsequent viewing. 

Preferably the recording system 100 supports a variety of functions such as fast- forward, 
rewind and visual scan of stored programs, and other functions supported by the storage medium 

15 106. For example, if the storage medium 106 takes the form of a VCR then the program viewing 
and manipulation functions will be limited to the standard VCR functions of fast-forward, 
rewind, forward or reverse visual scan. If the storage device 106 takes the form of a digital 
storage medium then more advanced program search and retrieval functions can be supported. 
It will be appreciated that the functions performed by the preference agent 110 and the 

20 recording manager 112 are illustrative of a particular embodiment. However, the division of 

tasks between the two modules 110 and 112 may be changed. In addition, the data formats 115, 
1 16, 105 and 107 may also take a variety of forms. 

Figure 28 illustrates an EPG and text information service in accordance with one 
embodiment of the present invention. As shown, a local cable television company's billing 

25 vendor., 10 communicates via a billing link to an RS-232 port of a system manager 12 located at 
a cable head end. Billing vendor 10 includes a subscriber database and generates a monthly bill 
for the subscribers in the system based on the level of service and any pay-per-view purchases. 
System manager 12 can be a personal computer or other processing device which receives 
viewer authorization transactions from billing vendor 1 0 and generates transactions for delivery 

30 to the distribution apparatus or the subscribers. Such transactions include text channel definition 
transactions which instruct the subscriber's set top box which group of channels it is entitled to 
receive, which frequency to tune for a particular text data channel, whether to mute the audio for 
that text channel, the pagination delay between pages, and the like. 
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System manager 12 also communicates via a head end link to an RS-232 port of a head 
end controller (HEC) 14 which controls the transmission of television programming to the 
subscribers. As will be described in more detail with respect to Figure 29, HEC 14 
communicates via a control link to an RS-232 port of an information services processor (or data 
5 controller) 16 which manages the flow of EPG and text data in accordance with the invention. 
As shown by dotted line in Figure 28, information services processor (ISP) 16 is preferably 
located at the cable head end with system manager 12, HEC 14 and the signal scramblers. 
However, those skilled in the art will appreciate that all of the head end equipment need not be 
located at one site. 

10 EPG data is supplied from one or more local or remote EPG suppliers 18 via a satellite 

link, modem link or other communication link to an RS-232 port of ISP 16. Similarly, text data 
from one or more text channel suppliers 20 is provided via a satellite link, modem link, or other 
communication link to another RS-232 port of ISP 16. In preferred embodiments, ISP 16 has a 
plurality of identical RS-232 ports for accepting data from a plurality of EPG suppliers 18 and 

1 5 text channel suppliers 20. 

Also, as shown, one of these RS-232 ports can be used for a control link to HEC 14 as 
well. ISP 16 manages EPG and text source databases in response to control signals from HEC 14 
in order to provide EPG data and/or text channel data to selected viewers. 

HEC 14 also provides control data directly to the viewer's set top box via an RS-485 

20 output port. Preferably, the control data from HEC 14 can include text channel definition 
transactions as well as EPG definition transactions for instructing the set top box at which 
frequency to tune for the EPG data and the like. The control data may also include software for 
downloading into the viewer's set top box for reprogramming the viewer's set top box as 
necessary. The control data from HEC 14 can be inserted into the vertical blanking interval of 

25 the selected cable television signal by daisy-chained scramblers 22, 24 and 26 using known in- 
band techniques, although the control data from HEC 14 may also be modulated on an out-of- 
band carrier or an in-band audio carrier for transmission as. Preferably, scramblers 22-26 are 
daisy-chained so that the scramblers may be addressed individually or globally. 

Similarly, EPG data and text channel data from ISP 16 are provided to the viewer's 

30 television set top box via an RS-485 output port of ISP 1 6. EPG data and text channel data are 

similarly inserted into the vertical blanking intervals of selected cable television signals by EPG 

scrambler 28 and text channel scramblers 30 and 32, respectively. Scramblers 22-32 may insert 

the control data, EPG data, and text channel data into other portions of the video signals such as 

the horizontal blanking intervals or else replace the video entirely. Typically, the number of 
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scramblers depends on the number of premium channels for which scrambling is used. 
Preferably, EPG scrambler 28 and text channel scramblers 30 and 32 are identical to control data 
scramblers 22-26 and are similarly daisy-chained for individual or global addressing. As shown 
in Figure 28, scramblers 28-32 receive a single serial data channel which carries the combined 
5 EPG data and text data and display control transactions, as described in more detail with respect 
to Figure 29) for all data streams in use. Each scrambler is also equipped with memory for 
storing a predetermined amount of this data in an internal memory so as to minimize the number 
of database accesses. Preferably, scramblers 28-32 have internal memory sufficient to store a 
significant number of transactions. For example, scrambler 30 may have enough internal 

10 memory to score a day's sports scores for display on a sports text channel. The data received and 
stored in scramblers 28-32 is preferably in RS-485 format, and the protocol in a preferred 
embodiment is SDLC. All data transactions to scramblers 28-32 are sent on individual data 
streams specifying the target scrambler (station addresses in SDLC protocol), and the control 
data is sent on a global data stream which is filtered in the scramblers 28-32 based on the 

15 address of the scrambler so that the data streams can be configured by a transaction from ISP 16. 
The individual EPG data and text data streams are preferably generic in the scramblers so that 
they can be allocated as desired. Preferably, scramblers 28-32 have baud rates of at least 9600. 

Preferably, the subscriber's set top box is aset top box 34 which comprises : anTSPG 
memory 36 for storing the EPG data from ISP 16. For example, EPG memory 36 may store one 

20 or two weeks of EPG data for selective access by the viewer via a menu of the set top box 34. 
This menu preferably allows the viewer to scroll through the EPG data stored in EPG memory 
36 using the key pads of the viewer's television remote control device. Set top box 34 may also 
comprise a nonvolatile template memory 38 for storing the template in which the EPG data is to 
be inserted for display to the viewer on the viewer's television 40. In this manner, a video signal 

25 containing the template display data need not be continuously retransmitted to the set top box 
34, thereby saving more bandwidth. Instead, the EPG data only needs to be updated every 30 
minutes or when there is a program change. Of course, different set top boxes 34 may have 
varied amounts of memory and processing capabilities for such purposes in accordance with the 
acceptable memory costs during manufacture of the set top box 34. 

30 As shown in Figure 28, set top box 34 may also comprise a text data memory 42 for 

storing a page of text data for presentation to the screen. Thus, while one page of text data is 
displayed to the subscriber, the next page of text data may be loaded into the text data memory 
42. 
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ISP 16 manages the flow of text data and EPG data from the data service provider to the 
viewer's set top box 34. ISP 16 manages this data by accepting data only from one or more 
authorized text data and/or EPG data sources, processing the text data and EPG data in its 
internal database manager, and formatting the processed data into a common data transaction 
5 format for output to the scramblers for transmission to the set top box 34. Provision of EPG data 
and text data to the subscribers is controlled by the head end controller 14 via the control link. 

One example of a suitable ISP 16 is an IBM PS model 7546 personal computer with a 
plurality of RS-232 serial input ports for EPG data and/or text data inputs and at least one 
RS-485 HDLC serial link at its output of the type used by HEC 14. As shown in Figure 28, the 

10 control link can be a single RS-232 serial port. The hardware and software components of ISP 
16 are then configured as illustrated in Figure 29. 

One embodiment of ISP 16 is illustrated in Figure 29 as a plurality of RS^232 ports 
which provide a common interface for the EPG data and text channel data asynchronously 
provided by the EPG suppliers) 1 8 and text channel suppliers 20. The EPG data and text 

15 channel data is transmitted to ISP 16 via a satellite link (when the interface is operated in 

simplex mode) or by modem (when the interface is operated in half duplex mode). Preferably, 
the data is transmitted at a baud rate of at least 1200. 

ISP 16 functions as a "gate keeper" which only allows access by authorized data sources. 
Accordingly, when ISP 16 receives a message from an EPG supplier 1 8 or a text channel 

20 supplier _0, it first checks the data for authorization. If that supplier is not authorized, the data is 
ignored. When the supplier is authorized to access ISP 1 6, ISP 1 6 performs the requested action 
and returns a command response message. If the communications link is simplex, the response is 
ignored. In this manner, access to ISP 16 can be limited by authorization codes, but as will be 
described below, access is also limited by whether the data provider provides EPG data or text 

25 data in the transmission protocol expected by ISP 16. 

Messages sent between an EPG supplier 18 or a text channel supplier 20 and ISP 16 can 
be formatted to include a start of text byte, a data block of ASCII characters, checksum bytes 
and an ASCII carriage return. This format is used in commands sent to ISP 16 from the data 
suppliers as well as in responses sent to the data suppliers. The checksum can be a two byte 

30 CRC of all bytes in the message field beginning with the first character following the start of 
text character up to but not including the checksum field. The checksum is transmitted in the 
message as the hexadecimal ASCII representation (four bytes) of the CRC computation. The 
data blocks can be configured differently depending upon whether the input data is EPG data or 
text data. 
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EPG data from the EPG supplier 18 can be formatted in accordance with an EPG 
command set including, for example, a Define Program Command which is used to identify all 
data relating to a single program, a Define Category Command which is used to establish a 
category for identifying different types of programs, and a Delete Category Command which is 
5 used to delete an unused category to make room in the database of ISP 1 6 for new programming 
categories. The EPG data may be formatted on a per program basis by these commands. 

Delimiter characters can be used for variable length fields such as the title and program 
description blocks to identify the length of the field. For example, a NUL (0 hexadecimal) 
means the field is null, SOH (I hexadecimal) means the field is valid, and ETX (31 hexadecimal) 

10 means the end of the current record. 

Once data transmitted with a Define Program Command is stored in an EPG database of 
ISP 16, the EPG data is formatted into transactions for transmission to the set top box 34. This 
command may also be used to update a program definition since it will overwrite a 
corresponding entry in the EPG database of ISP 16. 

15 ISP 16 may respond to such commands from the EPG supplier 18 by sending an 

appropriate response such as: no error (normal response), service provider not found (not 
authorized), type of service not found (not authorized), category ID riot found, unrecognized 
command, checksum error, insufficient disk space, and the like. Other EPG commands and 
command responses may be provided as desired. 

20 The text channel data can originate from many different text channel suppliers 20 and 

arrive at the ISP 16 through different communications links such as satellite, dial up modem, 
direct connect modem or with a direct connect to the system manager 12. 

ISP 16 can include a plurality of databases and database managers. As shown in Figure 
29, there may be two, types of databases maintained in ISP 16 — one type for EPG data and one 

25 for text channel data. The EPG database is designed to collect data from each EPG supplier and 
to sort each EPG program record by channel and time of day. A separate database is created for 
each text channel for collecting text data from the associated text channel supplier 20 and 
formatting the received text data for transmission on individual text channels using the 
techniques to be described below. Each database that is created is identified by the service 

30 provider and type of service codes listed in the Define Program Command for use in the control 

link commands provided to ISP 16 from HEC 14. 

As shown in Figure 29, a received command is checked for its command code, the 

service provider, type of service and authorization code, as appropriate, by router and formatter 

43. If the command is from an unauthorized data source, the subsequent data is ignored. 
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However, if the received data is from an authorized supplier, router and formatter 43 routes the 
data to the appropriate database within ISP 16. For example, if EPG data is received, it is routed 
via EPG database manager 44 to EPG database 46. 

In one embodiment, EPG database manager 44 sorts the received EPG data by channel 

5 and time of day and stores the received EPG data in the appropriate location in EPG database 46 
for later recall. EPG database manager 44 may also perform garbage collection on the EPG 
database 46 as records are deleted. EPG database manager 44 may also call a data compression 
software routine such as the Huffman Compression Algorithm which, as known to those skilled 
in the art, maps more frequently used characters to fewer bits than the usual eight bits used in 

10 normal ASCII, while giving the less frequently used characters more bits. The number of bits 
used for a character is based on its probability of appearing in the data stream. Huffman 
encoding is described in detail in an article entitled "Lossless Data Compression", Byte, March, 
1991, pp. 309-314, incorporated herein by reference. Such a routine is desired to maximize 
storage efficiency at EPG database 46. Similarly, each text database manager can store the text 

15 information in the associated text database and performs data compression. 

» 

Router and formatter 43 and database managers 44, 48 and 52 are all controlled by 
configurator 56, which is, in turn, responsive to control data from HEC 14. Configurator 56 
responds to control commands from HEC 14 to provide updated authorization information to 
router and formatter 43 for comparison with the incoming data and for adding/ subtracting 
20 database managers and databases and the like as EPG suppliers 1 8 and text channel suppliers 20 
are added and subtracted from the system. 

Access to ISP 16 is can be carefully controlled through the use of authorization codes. In 
addition, ISP 16 can maintain control over the information services provided to the viewer by 
storing the EPG data and text data in a particular format in the appropriate database within ISP 
'. 25 . 16. 

Referring now to Figure 30, the EPG database key can be a combination of the date and 
time field and the channel number from the EPG data. Following these fields are the duration 
field, the repeat field, the program rating field, the program category field, the critique field, the 
attributes flag field, the program traits flag field, the text data compressed flag and lastly the text 
30 data. The text data field may further consist of several optional subfields with a delimiter 
between each field. 

EPG database manager 44 can access the EPG database 46 through shared library 

routines such as add a record, delete a record, read a record, and the like. As the EPG database 

46 is used, it can be fragmented as records are added and deleted, and as a result, EPG database 
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manager 44 may further include garbage collection routines for periodically performing the 
garbage collection function on the EPG database 46. The text databases are similarly configured 
except that garbage collection is not necessary. 

In one embodiment, EPG transaction formatter 58 reads the database records of EPG 
5 database 46 and formats them to program-based transactions having a predetermined number of 
bytes which are transmitted to the EPG scrambler 28 for insertion into the vertical blanking 
interval of a video signal and transmission to the set top box 34. These transactions are then sent 
via a transaction arbitrator 64 to the EPG scrambler 28 shown in Figure 28 for insertion into the 
appropriate video channel. 

1 0 The transactions from transaction arbitrator 64 can be output to a single RS-485 output 

port of ISP 16 which is connected to multiple scramblers of the type used to scramble premium 
cable channels. The transactions are segmented into EPG data and text data streams for 
transmission to the EPG scrambler 28 (if the transaction includes EPG data) or to the text 
channel scramblers 30 and 32 (if the transaction includes text data). 

15 In one embodiment, the EPG transactions generated by EPG a transaction formatter 58 

are formatted into SDLC frames as noted above. A sample SDLC format for the EPG 
transaction data is shown in Figure 3 1 . In Figure 31, the beginning flag delineates the beginning 
of the SDLC frame, the station address delineates the scrambler to be addressed, the control byte 
is a command code that defines what is to be processed, the information field contains the EPG 

20 data formatted as in Figure 30, the frame check contains the CRC for all data between the 
beginning and ending flags, and the ending flag delineates the end of the SDLC frame. A 
transmission from EPG transaction formatter 58 will address a specific data stream and a 
response from the EPG scrambler 28 will identify its data stream in the station address location. 
As noted above, such transmissions may or may not require a response from the EPG scrambler 

25.. 28., v , 

The EPG transactions typically include an Add EPG Block command including a byte 
specifying that the following data is from the EPG data stream, a control code byte specifying, 
for example, whether a reply from the scrambler is expected, two bytes setting forth the EPG 
data block number, a flag setting forth whether the EPG data is Short Term or Long Term data, 
30 the number of transactions which make up the EPG data block, and the actual transactions. EPG 
transaction formatter 58 may also generate a Delete EPG Block command which specifies that 
the data is to be deleted from the EPG data stream, the control code byte, and the EPG block 
number to be deleted. 
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Figure 32 illustrates a flow chart for one embodiment of software that can be used and 

embodied in EPG transaction formatter 58. As shown, the software starts at step 500 and gets 

the system time and date from the ISP system clock 63 at step 502. An expired EPG data block 

is then deleted from the memory of the EPG scrambler 28 at step 504. An expired EPG data 

5 block is defined as a data block representing a program which has been completely aired prior to 

the current system time or a program which was aired before the time window used for the EPG. 

At step 506, current EPG data blocks having a time and date within the EPG time window are 

read from the EPG database 46. The current EPG data blocks are then formatted into Add EPG 

Block commands and associated transactions at step 508. A block/time map of EPG transaction 

10 formatter 58 is then updated at step 510. The block/time map preferably stores the time that each 

EPG data block was sent to EPG scrambler 28. The transactions representing the EPG data are 

then transmitted the EPG scrambler 28 at step 512. EPG transaction formatter 58 then waits at 

step 5 14 for the next EPG update (which should occur when the system time enters a new half 

hour) or the next EPG change (which may occur at any time). Upon receipt of such an update or 

15 change, control returns to step 502. Text transaction formatters 60 and 62 similarly generate text 

transactions for the text data, which as noted above, is defined on a per screen (rather than per 

program) basis. Hence, an Add Text Screen command is similar to an Add EPG Block command 

except that the text channel number and screen number are provided in place of the EPG block 

number and Short Term/Long Term data bytes. The text transaction formatters 60 and 62 may 

20 also request the time from the scrambler so that proper pagination may be maintained. 

Figure 33 illustrates a flow chart for one embodiment of software that can be used and 

embodied in text channel transaction formatters 60, 62. As shown, the software starts at step 600 

and reads a text screen record from the text database 50 or 54 at step 602. At step 604, the text 

screen is formatted into Add Text Screen transactions for transmission to the text channel 

25 scramblers 30, 32 at step 606. Preferably, such transactions are formatted such that the display 

characters are sent as display commands rather than as separate characters for every display 

coordinate of the text display screen. Then, at step 608, text transaction formatter 60, 62 waits a 

period of time specified by system manager 12 (if auto-pagination is used) before the next text 

page is formatted and transmitted to the text channel scramblers 30, 32. At the end of this period 

30 of time, control returns to step 604 and the next text screen record in the text database 50, 54 is 

formatted for transmission to text scramblers 30, 32 for insertion in the vertical blanking interval 

of a particular video signal. 

Text data can be passed to the screen by sending a separate character for each display 

location of a page. In other words, if a text screen comprises 16 lines and 4 characters per line, a 
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text screen is represented by sending 384 (24X16) characters, one for each display location for 
that display screen. A blank space character is sent to indicate that no character is present in a 
particular text screen location. 

In one embodiment, the text data is transmitted to the screen along with appropriate 
5 commands for controlling the display of the text data. For example, a first display command in a 
sequence identifies the following data as text data and instructs the set top box 34 to fill the 
television screen with a blue background or some other background or template over which the 
text will be displayed. The text data is then converted into a series of commands which together 
identify the separate screens of text. As noted above, the text data is grouped on a per screen 

10 basis, which allows the appropriate delay mechanism to be incorporated into the display 

commands to provide the necessary delay between the presentation of respective text screens. 

For this purpose, transaction formatters; 60 and 62 can include software for scanning the 
text data for actual characters, skipping extra spaces in the text data, and grouping the actual text 
for transmission in transactions of a designated size which will fit in the vertical blanking 

15 interval of a field of a video signal. Since spaces are eliminated, the display commands include a 
coordinate specifying the row and column address of the first display character on the screen and 
a number of contiguous characters follow that character in the same transaction until the 
transaction is filled or a number of successive spaces are encountered. 

Attribute information such as underline, blinking, or luminance inversion associated with 

20 the characters may also be transmitted using these display commands. These display commands 
are used to read the text data for a text screen from the appropriate database, and at the end of 
the text data for a text screen, a display command is transmitted to indicate that all data for that 
text screen has been transmitted. The transaction formatter 60, 62 also includes a wait loop or 
"timeout" command at the end of the transmission which builds in a delay (on the order of 7 

25 seconds) which gives the viewer sufficient time to view a text screen before the text data for the 
next text screen is displayed, thereby providing auto-pagination of the text screen. 

Auto-pagination permits the viewer to automatically advance from one text screen to the 
next without any intervention by the viewer. In accordance with the auto-pagination scheme of 
the invention, the cable operator can specify the time duration between screens and forward this 

30 information to the transaction formatters; 60, 62. Then, during operation, when the viewer 

selects a text channel, the current page of text data is displayed by extracting the selected text 

channel data from the vertical blanking interval of the video signal in which it is inserted and 

mapping that text data to a channel of the viewer's television which is designated for display of 

that text channel. The next screen of text data will be displayed after a predetermined delay 
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which gives the viewer sufficient time to read the displayed text data for the current screen 
(approximately 7 seconds). This technique could replace the commonly used "barker" channel 
which uses a computer to generate text data which is then transmitted as a complete video 
channel over thexable television system. 
5 Configurator 56 can respond to control commands from HEC 14 to provide updated 

authorization information to router and formatter 43 for comparison with the incoming data and 
to add/subtract database managers and databases and the like as EPG suppliers 1 8 and text 
channel suppliers 20 are added and subtracted 57 from the system. The control link between 
HEC 14 and configurator 56 can be used to report the status of the ISP 16 to system manager 12. 

10 Additionally, the control link may accept text data from system manager 12 for displaying 
system messages and the like. 

In one embodiment, the interface between configurator 56 and HEC 14 is an RS-232 port 
with a data format fixed at, for example, 9600 baud. All control data is preferably transmitted as 
ASCII characters. Upon receipt of a message from HEC 14, configurator 56 checks the data, 

1 5 performs the requested action, and returns a command response message in a message format of 
the type described above for communications between router and formatter 43 and the EPG and 
text channel suppliers. Sample commands sent from HEC 14 to configurator 56 over the control 
link include a Set Date and Time command (for synchronization purposes), Request 
Configuration commands, Request Status commands, Get Category Record commands, 

20 Scrambler Control commands, and Database Control commands. 

In one embodiment, during operation, ISP 16 monitors all input ports data from the EPG 
and text data service providers and builds a list of all available EPG and text data services. This 
list is sent to the system manager 12 using a Request Configuration command. This command 
specifies the available service providers, the type of service (EPG or text data) from each 

25 provider, the commuhications port associated with each service, the scrambler address or data 
stream (EPG or text data) for each service, the authorization code from the supplier for each 
service, the time and date of the last update from the service provider, the time and date of the 
last update to the scramblers, the time and date of the latest EPG data in the EPG database, and 
the like. Such information is provided to the system manager 12 for each service provider when 

30 this command is given. 

The Request Status command can contain flags indicating whether there are errors 

present in the error log and if the category list has changed since the last Request Status 

command was received. Get Error Record and Get Category Record commands may then be 

used to extract the error and category information. 
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In one embodiment, configuration commands are separated into EPG and text service 
configuration commands. A Configure EPG Service command specifies the service provider, the 
type of service, whether the service is to be enabled or disabled, the authorization code from the 
EPG supplier 1 8, the scrambler data stream for Short Term data, the scrambler data stream for . 
5 Long Term data, the length of the Short Term data in hours (1-255), and the length of the Long 
Term data in hours (1-4096). The Configure Text Service command specifies the service 
provider, the type of service (weather, sports, and the like), whether this service is to be enabled 
or disabled, the authorization code from the text channel supplier 20, the scrambler address or 
data stream for the text data, the channel number, and the pagination delay time in seconds) 
10 before the next page of text data is to replace the current page of text data on the screen for auto- 
pagination. 

The scrambler control commands can include, for example, a Rebuild Scrambler 
Memory command which is used when a scrambler replaced and needs data to be reloaded in its 
memory and a Scrambler Configuration command for specifying the amount of scrambler 

15 memory in kilobytes. 

In one embodiment, the database control commands include, for example, a Clear 
Database command which is used to clear the database associated with a particular service and a 
Delete Database command which is used to delete the database associated with a particular 
service. Other database control commands such as a Download Category Map command may 

20 also be provided for establishing a list of the specified categories of program data in the EPG 
data. 

Figure 34 illustrates one embodiment of a set top box 34 that can be used with the 
present invention. As shown, set top box 34 includes EPG memory 36, template memory 38, 
text page memory 42, a set top box 700, and a set top processor 702 which reads commands 

25 from the vertical blanking interval of the incoming video signal and performs the appropriate 
action. For example, if the incoming command is a text channel definition or EPG definition 
command from HEC 14, the appropriate update of bit map 704 is performed. Similarly, if the 
incoming command is a display command including EPG data, that data is stored in EPG 
memory 36 and is displayed with the template stored in template memory 38 when the user 

30 makes a menu selection via television remote control unit 706 and remote receiver 708 

requesting display of the EPG data. The template data may be sent as part of EPG display 

commands if no template memory is provided. On the other hand, if the incoming command is a 

display command including text data, a page of that data is stored in text page memory 42 for 

presentation to the display a page at a time. The text page memory can be automatically updated 
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every few seconds by virtue of the delay built into the display commands from the text 
formatters 60, 62 if auto-pagination is enabled. Alternatively, the user can be allowed to 
manually access the text data in the memory. If manual access is provided, it is preferred that the 
text page memory hold at least the currently displayed text page, the previous text page and the 
5 subsequent text page in order to give the user the ability to scroll through the text data In either 
case, set top processor 702 preferably has the ability to request the next text page while the 
current page is being displayed so that the next text page is already loaded for display at the end 
of the screen delay time. The selected text, EPG or video signal is then modulated at modulator 
710 for display on television screen 40 at the channel specified in bit map 704. 

10 Bit map 704 of set top processor 702 of the set top box 34 maps the received text 

information to the designated cable channel for display by designating the frequency that must 
be tuned by box 700 to receive the desired text data. This information is received in the 
aforementioned text channel definition transactions from HEC 14. For example, the viewer may 
specify via television remote 706 that she wishes to view a sports text data channel which her 

1 5 program guide indicates to be available by tuning the set top box 34 to channel 181. Set top 
processor 702 then checks bit map 704 for channel 181 to determine that it must tune the 
frequency for channel 29 in order to extract the sports text data for the viewer's channel 1 8 1 
from the vertical blanking interval of channel 29, set top prbcessof 702 ten sets set top bo>TT00 
to tune channel 29 but the video signal for channel 29 is not displayed. Instead, the video screen 

20 is blanked by set top processor 702 and the text data extracted from the vertical blanking interval 
by set top processor 702 is displayed. Any necessary descrambling of the received video is 
performed by set top processor 702. The viewer thus perceives that many more "virtual" 
channels are available even though a separate video channel was not used to transmit the text 
data. 

25 ' In brie embodiment of the present invention, illustrated in Figure 42, electronic content is 

tagged and aggregated at a server and becomes electronic content with targeted information. 
Suitable electronic content includes but is not limited to, advertising, news segments, video 
programs, audio programs, WEB pages, and the like. The electronic content with targeted 
information is send to a broadcast server and broadcast widely. Set top box 34 has created 

30 profiles representing different people in the household. Set top box 34 matches the tags in the 
broadcast stream with the profiles created and determines which of the tagged content should be 
downloaded. A subset of all of the content that is broadcast that are relevant to the personalities 
in the household are downloaded and displayed to the user. 
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For example, set top box 34 can be used to match the tags in the broadcast stream with a 
demographic profile of a viewer previously created and determine which of the tagged content 
should be downloaded. An individual viewer receives from set top box 34 only.* Set top box 34 
matches the tags in the broadcast stream with the demographic profile of the viewer created and 
5 determines which of the tagged content should be downloaded. 

Some example of demographic characteristics include, but are not limited to, a user's 
gender, race, age and income. The output of the analysis of viewing habits of the representative 
sample provides a basis for determining demographic characteristics of the individual user. 
As shown, set top box 34 reads commands of the incoming tagged and aggregated 

10 electronic content and performs the appropriate action. For example, if the incoming command 
is a text channel definition or EPG definition command from HEC 14, the appropriate update of 
bit map 704 is performed. Similarly, if the incoming command is a display command including 
EPG data, that data is stored in EPG memory 36 and is displayed with the template stored in 
template memory 38 when the user makes a menu selection via television remote control unit 

15 706 and remote receiver 708 requesting display of the EPG data. The template data may be sent 
as part of EPG display commands if no template memory is provided. If the incoming command 
is a display command including text data, a page of that data is stored in text page memory 42 
for presentation to the display a page at a time. The text page memory can be automatically 
updated every few seconds by virtue of the delay built into the display commands from the text 

20 formatters 60, 62 if auto-pagination is enabled. Alternatively, the user can be allowed to 

manually access the text data in the memory. If manual access is provided, it is preferred that the 
text page memory hold at least the currently displayed text page, the previous text page and the 
subsequent text page in order to give the user the ability to scroll through the text data. In either 
case, set top processor 702 preferably has the ability to request the next text page while the 

25 ciment page is being displayed so that the next text page is already loaded for display at the end 
of the screen delay time. The tagged and aggregated electronic content is then modulated at 
modulator 710 for display on television screen 40 at the channel specified in bit map 704. 

Bit map 704 of set top box 34 maps the received tagged and aggregated electronic 
content to a designated cable channel for display. Set top processor 702 then checks bit map 704 

30 for channel 1 8 1 to determine that it must tune the frequency for channel 29 in order to extract 
the. Any necessary descrambling of the received tagged and aggregated electronic content is 
performed by set top processor 702. 
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The present invention also provides a mechanism to automatically create multiple 
profiles corresponding to multiple users with or without any information being explicitly 
providing by the users about themselves. Fig 35 shows an embodiment of the invention that 
causes the automatic creation of multiple profiles and automatic identification of profiles when 
5 they are active on the device. 

In one embodiment, number of profiles to be created is initially determined. Several 
methods may be used to determine the number of profiles. In one embodiment of the current 
invention the number of profiles is optimally determined by applying the Minimum Message 
Length (MML) criterion. The process of applying MML criterion for determining the optimal 

10 number of clusters in described in reference (ref..). In one embodiment of the current invention, 
a user of the device explicitly specifies; the number of profiles to be created. 

The system monitors user interactions with the device. The user interactions may include 
but are not limited to channel change requests, requests to view more information, configuration 
of device parameters, requests to performs recording or deletion of programs from a storage 

15 device. All actions due to user interactions are recorded in a history database that stores history 
of viewer actions. Data records in the history database describing viewer action can be of 
different formats. Figure 37a shows one of the possible formats for the data record. Data records 
may contain information about the action, time at which the action occurred, and some 
parameters that further describes the action. In one embodiment of the invention, only actions 

20 matching certain criteria are recorded in the history database. For example, all user action about 
watching channels can be ignored if the duration of watching is less than a configurable 
threshold, or all user action about watching a particular channel like preview channel can be 
ignored. In the default usage of the device, the user will not be identifying himself or herself 
during each usage session. A usage session in this context can be the period during which there 

25 are user activities between two periods of inactivity, or the period during which the device is 
used between two periods during which the device is not in use. It will be left to the device to 
determine who the actual user is, in order to provide a very personalized environment to the 
user. In such a scenario the user action records will not contain any information about who the 
user is. In one embodiment of the current invention, the user can identify himself or herself 

30 explicitly for all or some of the usage sessions by using some mechanism provided by the 

system. These mechanisms may include pressing a sequence of keys, or choosing a user name 

from a menu and logging-in as that user. In this embodiment of the invention the user can 

identify himself or herself during certain sessions and may choose not to identify himself or 

herself during some other sessions. In the case where the user identifies himself or herself 
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certain user action records can have the user name or some other identification data as a 
parameter which specifies that a user was togged-in for the session from which the user action 
record was generated. 

By analyzing the user action history database that was generated for a period of time, the 
5 current invention provides a mechanism to create multiple profiles automatically. Each of these 
profiles may correspond to the entire preference of a single user, a group of related preferences 
of a single user, a group of related preferences of a group of users or the entire set preferences of 
a group of users. The mechanism for creating multiple profiles is described in figure 36. 

Sets of consecutive user action records are grouped together to form usage pattern 

1 0 records. Usage pattern records can be in the form of arrays of user action records. Only user 

actions that occur contiguously are grouped together in a single usage pattern record. The usage 
pattern records can be formed using many methods, some of which are: 

1 . Grouping together all user action records that are in a single usage session into a 
single usage pattern records. This is represented graphically in figure 38a. 

15 2. Grouping together all user action records that are in a single usage session into a" 

one or more usage pattern records where each usage pattern record has a predetermined number 
of user action records. This is represented graphically in figure 38b. 

3. Grouping together a predetermined number of consecutive user action records in 
a usage session into one or more usage pattern records where each usage pattern record has a 

20 number of user action records which overlap with some of the user action records in an adjacent 
usage pattern record. This is represented graphically in figure 39. 

Each usage pattern record is mapped to a point in an N-dimensional space, each axis of 
this N-dimensional space representing a parameter that is of significance in identifying multiple 
profiles. The N-dimensions for this space are called cluster-dimensions. The parameters for this 

25 N-dimensional space are chosen either manually by people skilled at identifying the most 
significant parameters in identifying multiple profiles, or automatically by identifying the 
aspects of profiles that differs significantly between profiles. In one embodiment of the current 
invention a set of these parameters are identified and configured as the apparatus is configured 
during initialization, and a new set is added periodically by looking at aspects which differ in the 

30 multiple profiles that are stored in a device. A wide variety of combinations of initial parameters 

may be used. In one embodiment these initial parameters are the channel names in a channel line 

up and viewing times. In the embodiment, these initial parameters are possible values of 

program description fields of television programs. In one embodiment the rate of channel change 

is also one of the parameters. The mapping of the usage pattern record to a point in the space 
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defined by cluster-dimensions can be done by many methods. One method determines the 
quantity of a particular characteristic which is defined by a clustering parameter, exhibited in a 
usage pattern record, e.g. number of channel changes in a usage pattern, and uses this as the 
value of the axis- for the corresponding dimension. One method determines the.rate of 
5 consumption of a particular characteristics in a usage pattern record, e.g. rate of consumption for 
"NBC" in a usage pattern record is .5 which indicates that programs on "NBC" were watched 
50% of times during the period of this usage pattern record. 

All the points in the space defined by cluster dimension are clustered into a number of 
clusters using standard clustering algorithms. Any clustering algorithm can be used to perform 

10 the clustering. In one embodiment EM clustering is applied to group the points in the cluster- 
dimension space into a predetermined number of clusters. The number of profiles being created 
decides the number of clusters formed. Anyone skilled in the art of artificial intelligence and 
clustering, especially EM clustering, will be familiar with these clustering techniques. Each 
cluster formed as a resulting of the clustering represents a single profile. The clustering process 

15 also provides the mixture weights for each of the clusters as one of the outputs. The mixture 

weight for a cluster can be used to compute the percentage of time the profile was active, of the 
total amount of time for which the device was used. 

In one embodiment of the present invention, the clustering is performed periodically 
using the user actions records accumulated in the history database. In one embodiment, user 

20 action records are periodically removed from the history database based on certain criteria such 
as the age of the record, size of the record, relevancy of the record etc. 

The current invention also provides a mechanism to predict profile that is active at any 
given time. Figure 40 illustrates the method for identifying the profile currently active on the 
device. Current user actions are monitored and user action records are created. The most recent 

25 user action records are used to create a usage pattern record. The usage pattern record is mapped 
to a point in the cluster dimension. Using the Information about clusters representing multiple 
profiles, the, probability of each of the clusters being active given the usage pattern is computed. 
This is computed by using Bayesian Probability theory that can be used to compute the posterior 
probability using the prior probability and mixture weights. The probability of a cluster being 

30 active is computed using the probability of the usage-pattern record given the profile 

(P/usage_pattern profile), the probability of the profile being active and the probability of the 

usage record occurring. The mathematical representation for the Bayesian Probability theory in 

the current context is given below 

P(profile/usage_pattern) = P(usage_pattern, profile) * P(profile)) P(usage_pattern) 
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In this equation probability of usage pattern record occurring given profile (P(usage- 
pattern, profile)) can be computed by knowing the probability distributions governing the 
clusters. In one embodiment, the cluster probability distribution can be assumed to be the 
Gaussian probability distribution, so that the P(usage_pattern profile) can be computed using the 
5 cluster center and cluster variation. Cluster center and the cluster variation are the output 
generated by the clustering process performed for generating multiple profiles. 

In one embodiment, switching the device on can be one of the user actions recorded and 
"time of usage" one of the clustering dimension. In this .embodiment, as soon as a user switches 
the device on, the probability for any cluster being active can be determined, even with out any 

10 further user actions. As the user performs more user actions, the current usage pattern record is 
refined to include more user action records and the probability of profiles which may be active is 
refined with the addition of each new user action record. As more user action records are added 
to the current usage pattern, a set of the oldest user action records may be removed from the 
current usage pattern record. This process ensures that the effect of individual user actions in the 

15 identification of active profiles decreases as time passes. 

In one embodiment of the current invention, the clusters generated by the clustering 
procedures are used to create multiple profiles using a method described in Figure 41 . In this 
embodiment, the clusters are used to compute the probability of each clusters being active 
during the period of each usage pattern record. User preferences for each profile are created 

20 using processes described above using the user actions weighted by the probability of the 
corresponding cluster being active. 

In yet another embodiment of the present invention, multiple profiles may be created for 
each user. Each of such multiple profiles may be used to describe the user's viewing preferences 
at various time periods, such as weekday or weekend, or morning or evening. The profiles may 

25 also vary according to" other variables, such as showing more sports during football season, more 
gardening shows during spring, more political commentaries every four years during the 
primaries, etc. Thus, once the system of the invention is activated and the user who activated the 
system is recognized, the system may further determined the time of day and/or week and access 
the appropriate profile for the user. 

30 With reference now to Figure 43, in another preferred embodiment of the present 

invention used to deliver advertising content to specifically targeted viewers based upon their 

individual viewer profile, the broadcast network amasses content from the many different 

content providers and combines it at a network broadcast head end 29 into a broadcast signal 

that is transmitted to, and received by, the television control system 2 of the present invention. 
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The control system 2 sorts through the content received from the broadcast network based upon 
the profiles of the different viewers in the particular household, compiles lists of preferred 
programs available to each user, records and stores programs of interest, and makes the 
programs available for viewing when required. Equally important, the control system 2 receives 
5 and sorts through various advertising content and stores the content most relevant to the viewers 
in the house hold based upon their demographic profiles and the target audience of the 
advertising content. 

With greater particularity, and as shown in Fig. 43, content providers include EPG 
suppliers 18, text channel suppliers 20, conventional programming suppliers 23, and alternative 

10 content suppliers 25. As described below, alternative content refers to highly targeted 

programming that may be delivered through the use of a control system 2 as described herein. 
Conventional programming encompasses all types of programming currently available and 
including, but not limited to, movies, news, sport events, educational programs, pay-per-view 
shows, and shopping channels. Advertising content suppliers 19 provide promotional material 

15 such as short video clips or graphical and text information to programming suppliers 23 for 
inclusion in their conventional programming content, in a manner familiar to TV viewers 
worldwide. However, in a novel method of delivering advertising content uniquely available 
through the use of the present invention, advertising content may be supplied to alternative 
content suppliers (see discussion below) and may also be profiled through a profiler 21 to 

20 develop meta data descriptive of each individual advertising message. As discussed elsewhere in 
the specification, the meta data refers to demographic characteristics that describe the viewers to 
whom each advertisement is specifically targeted. 

The profiler 21 may be a software based system or may rely upon human intervention to 
watch and analyze each advertising message and develop meta data descriptive of the 

25 adyertisement. Alternatively, the advertising content suppliers 19 may provide this information 
along with the advertisement. The meta data is developed based upon the parameters of interest 
to the clients for whom the ads are developed, and will typically include the age, gender, ethnic 
background, education, profession, and other demographic information related to the viewer. 
The meta data will thus describe the viewers to which each advertising message is targeted in 

30 terms of the viewers' demographic information as defined by the parameters mentioned 

previously. The meta data thus generated is then appended to each advertisement and the thus- 
tagged advertisements are provided to the broadcast network for inclusion in the broadcast 
signal. 
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The tagged advertisements will preferably be provided in a plurality of channels, wherein 
each channel may carry a specific type of advertisement in terms of the target audience. Thus, 
each channel may be defined according to one demographic parameter or an often recurring 
combination, e.g. a channel with ads for men, a channel with ads for women, a channel with ads 
5 for male sports fans, etc. Alternatively, each channel may carry a stream of all types of ads, 
wherein each type of ad is present in a ratio dependent upon the demographic parameters to 
which it is targeted. Thus, ads targeted to male sports fans on weekend afternoons will be more 
numerous than ads targeted to young children, because of the large number of sports events 
broadcast at such time and the high probability that young children will be playing outside rather 

10 than watching TV indoors at such times. Although providing tagged advertising content via a 
plurality of channels is not a requirement of the invention, it is believed that the sheer amount of 
advertising content available will require multiple channels as a practical matter. Furthermore, 
providing channels in a pre-sorted manner may reduce the amount and complexity of the sorting 
carried out by the control system 2 (as more fully described below). 

15 Also as shown in Fig. 43, the signal broadcast by the network may further include 

information provided by the administrator of the service provided by the invention, e.g. 
Metabyte Inc., the assignee of this application. As described in more detail elsewhere in the 
application, this information may include updates of the software of the control system 2. 

With continued reference to Fig. 43, the network broadcast signal is transmitted to, and 

20 received by, the control system 2. The broadcast signal may be analog or digitally encoded, and 
may be transmitted via cable, satellite, telephone line, or any other practicable manner. In view 
of the large number of channels typically available and the state of the art, practical 
considerations will most likely dictate that the network signal be broadcast in digital form, in 
which case both the network head end 29 and the control system 2 will need to incorporate ADC 

25 and.DAG circuitry* in a manner very well known to those skilled in the art. Furthermore, it is 
understood that the control system 2 will need to incorporate channel tuning circuitry that, 
although not shown in the figure, will be necessary to separate the numerous signals multiplexed 
in the broadcast signal and select any one of these signals (Le. channels) as necessary. 

Once received by the control system 2, the broadcast signal is supplied to various 

30 components of the system. The program switch 1 14 responds to the viewers input as provided 

via a remote control or similar unit to select the desired channel and direct the channel signal to 

the TV monitor 108. As described previously, the preference agent 110 monitors the viewing 

selection of the various viewers using the control system 2 and creates viewing profiles of each 

viewer that are stored in the preference database 116. Based upon these profiles, the preference 
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agent 110 sorts through the incoming programming content as described in the EPG information 
to compile lists such as "Top 10" lists of viewing choices available at any given time to each 
viewer, and directs the recording manager 1 12 to record the top-ranked program being broadcast 
at any given time (including any programs selected by the viewers for recording) and store it in 
5 the stored programs memory device 35. 

The preference agent further contains software that allows it to create a demographic 
profile for each viewer, based upon the viewing profile of the viewer and certain algorithms or 
associative rules. These algorithms may be adjusted over time as the model employed by the 
system administrator 27 is enhanced and its accuracy improves. To this end, the system update 

10 information channel included in the broadcast signal may include periodic software updates, 

including new preference database parameters that may need to be included at the request of the 
advertising suppliers 1 9. Thus, in one embodiment the control system 2 of the invention may be 
remotely upgraded to meet any new demands that may arise as advertising content providers 
become familiar with the system and the process of custom tailoring narrowly focused, targeted 

15 advertisements. The demographic profile created for each viewer is stored in the demographic 
database 3 1 , which resides in the control system 2 and thus ensures the viewers privacy. 

The preference agent 110 also sorts through the advertising content streaming in through 
multiple advertising channels contained within the broadcast signal and, based upon the 
demographic profiles of the viewers and the meta data contained in each advertisement to 

20 describe the target audience for the particular advertisement, stores and/or causes the display of 
particular advertisements. The control system 2 may utilize any of a variety of methods to 
manipulate the advertising content, as described below. 

In one embodiment of the invention, the advertising channels each carry the same type of 
advertising. The preference agent determines which viewer is watching TV at any given time 

25 arid stores in stored adSiheriidry device 33 those tagged advertisements that are targeted to the 
particular viewer. At appropriate times during the program that is being watched by the viewer, 
such as during the commercial breaks typically inserted in most TV programs, the preference 
agent directs the program source switch 1 14 to access the stored ads and play a selected 
advertisement on the TV monitor 108. If the program being watched by the viewer contains 

30 information regarding the length of the commercial break, the preference agent may select stored 

ads of appropriate length to insert in the allotted time slot. The preference agent may further 

keep track of the ads that have been previously played to ensure that all stored ads are displayed 

equally. Alternatively, tagged ads may contain the desired number of times that the 

advertisement provider wants the ad to be aired during any given day, or perhaps the specific 
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times at which the ad should be shown. Thus, after a number of ads having been stored, e.g. 
sufficient for a 24 hour period, the preference agent may review all of the stored ads meta data 
and develop a strategy for showing these ads to the viewer including when and how often. 

Alternatively, the preference agent may cause a certain number of ads to be displayed 
5 and direct the recording manager 1 12 to record the selected program if the stored ads being 
displayed run longer than the allotted time slot for the commercial break. The control system 2 
of the present invention can therefore manipulate the broadcast schedule of a program to a 
certain extent by modifying the amount of advertising content that the viewer is subjected to. 
In another embodiment, the advertising channels may be operational for only a limited 

10 time during the day, typically at off-peak hours (e.g. 2 a.m.), during which all advertisements for 
the coming day may be downloaded and stored. Thus, an advertising channel may be turned on 
at a certain time and stream all advertisements to the control system during the space of an hour 
or two. Alternatively, a regular programming channel may suspend programming for an hour or 
two at a convenient time and supply the advertisements for the following day. In yet another 

1 5 alternative, a dedicated connection such as a phone line or an internet connection may be used 

* 

for periodic downloads of advertisements. 

In an alternative embodiment, multiple advertising channels carry a mixture of 
advertisements such that at any given time the preference agent has the option ofselecting at 
least one advertisement targeted to the viewer from one of these channels. In this embodiment 

20 the control system need only to store one advertisement at any given time to ensure continuity 
between the program being watched and the advertisements. Thus, by way of example, the 
commercial break in the program may occur at a time that does not correspond to the beginning 
of an advertisement on any of the advertising channels. In this case, the advertisement stored by 
the control system is directed through the program source switch to the TV monitor while 

25 another targeted advertisement is concurrently stored for subsequent display. If the commercial 
break happens to coincide with the start of a targeted advertisement on any of the advertising 
channels, the preference agent can simply cause the program source switch to direct the 
particular channel to the TV monitor while another advertisement from another advertising 
channel is being stored. 

30 In another embodiment, each regular programming channel can carry multiple, 

multiplexed versions of an advertisement. When a commercial break occurs, the preference 
agent selects the most appropriate version of the advertisement and directs it to the program 
source switch. This embodiment would require additional circuitry to de-multiplex the various 
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versions of each add and apply the particular version selected by the preference agent to the 
program source switch. 

The novel method of delivering targeted advertising that is provided by the invention is 
extremely advantageous to the advertising community. In one aspect, the invention allows 
5 greater freedom in producing advertisements because they no longer need to be appealing to as 
wide and varied an audience. Furthermore, because the alternative advertisements are delivered 
in real time, all viewers are reached by individualized advertisements at the same time , thus 
providing significant time savings in comparison with the prior art, wherein five different 
advertisements would require five different time slots during which to be broadcast. The cost 
10 savings thereby realized, especially during very popular events such as the Super Bowl, can be 
significant. 

As mentioned above, preference agent 110 may sort through available channels to select 
the ten (or any other number) programs currently playing that most closely match the viewer's 
viewing profile. In addition, the preference agent may also build customized listings of future 

1 5 programs based upon the viewer's profile as well as any additional criteria specified by the user. 
In a preferred embodiment the user will have the ability to fully customize his viewing profile, 
including the values assigned to the different parameters that make up his or her viewing profile. 
In an alternative embodiment the user may even be aJIowed to specify what kind of advertising 
he prefers. Such a configuration would likely generate an alternative billing arrangement, 

20 whereby the user agrees to watch advertisements in exchange for the ability to specify the types 
of advertising he wishes to be subjected to and perhaps some sort of financial incentive. 

Another novel possibility engendered by the present invention is the creation of a 
'personal channel' that is always showing the most interesting program being broadcast at any 
given time and/or previously recorded and stored programs that were very close matches to the 

25 viewer's profile. Thus, turning on to the personal channel will always guarantee the viewer the 
most individualized, interesting viewing experience available based upon the programs 
broadcast during a certain preceding time period that the viewer may specify. The personal 
channel may thus show a collage of movies, news segments, sports events, and any other 
programming content that was broadcast during the preceding 24 hours and that matched the 

30 viewer's profile to within a specified degree. 

An additional element of control system 2 that may be incorporated only in selected 

control systems is a privacy filter 37 that deletes any personal information from the demographic 

database and transmits this anonymous information to the system administrator 27 for purposes 

of maintaining and updating the model used to generate the demographic database. Such models 
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of the control system will require a feedback line for transmitting the demographic information 
to the system administrator, and in a preferred embodiment is a telephone line or a dedicated 
internet line such as DSL, cable, etc. The system administrator may offer financial or other 
incentives to users to convince them to supply this type of information. By carefully selecting 
5 users across a wide demographic cross section, the system administrator can use the information 
thus gathered to enhance the model used to develop a viewer's demographic profile based upon 
his or her viewer profile by comparing these users* actual demographic data with the 
demographic profile developed by the preference agent. 

Due to the narrowly focused nature of the advertising delivered by the invention, the 

1 0 targeted advertising developed for delivery by the control system of the invention may include 
novel elements such as coupons or highly targeted descriptions. Thus, targeted advertising 
developed for distribution via the present invention may include specific information, couched 
in specific language, that is intended to be especially appealing to the target audience. 
The invention further allows the creation of highly targeted content other than 

15 advertisements that can be delivered only to a very specific audience. Thus, movies, shows, 
religious programs, video magazines, infomercials, etc., may be developed to reach a very 
specific audience without the restrictions typically imposed on the content developers when the 
program will reach, or at least be accessible to, a very wide audience. This embodiment will 
require that such specific content be supplied via dedicated channels that cannot be tuned to 

20 directly by a conventional TV tuner, and thus may only be accessed through the control system 
2. Such highly targeted content may be provided by alternative content suppliers 25 or even be 
developed for alternative distribution by conventional programming suppliers 23. Thus, use of 
the present invention may create a new distribution medium that will allow the content providers 
to not only reach a very specific audience but also to remotely, automatically exclude certain 

25 segments of the audience from accessing this material. Such alternative content could be 
broadcast exclusively on a viewer's personal channel, as described previously. 

The present invention thus enables the delivery of highly targeted content to a viewer 
that includes not only advertisements, but other programming and content. With reference now 
to Fig. 44, a time line 800 is shown tracking the progress of a typical television program 850. In 

30 conventional linear programming as currently available to TV viewers from broadcast as well as 

cable/DSS stations, program 850 is composed of scenes such as Scene 1, Scene 2, etc., and 

advertisements such as Ad 1, Ad 2, etc. In accordance with conventional linear programming, 

the scenes are shown sequentially as predetermined by the producer of the program, and are 

interspersed with showing of the advertisements as determined by the head-end operator. In 
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accordance with the invention, customized linear programming is enabled due to the 
development of demographic and/or viewer preference profiles for each user. Thus, a 
customized linear program 860 may still be comprised of scenes interspersed with 
advertisements, but some or all of the scenes as well as the advertisements may now be targeted 
5 to various demographic or viewing traits of the user. Thus, Scene 2 may actually have been shot 
in two different versions by the producer of the television program, Scene 2 and Scene 2a, 
wherein each version of the scene is more appropriate, or of more interest, or less offensive, to a 
particular target audience. Thus, in one possible embodiment, Scene 2a may be a less violent or 
less graphic version of Scene 2. The system of the present invention receives TV signals 
10 carrying both scenes, and selects among the available alternatives based upon the profile of the 
currently watching user and the target data of the alternative scenes. Program 860 as shown in 
Fig. 44 is thus shown to the user in a linear, seamless manner as comprising Scene 1, alternative 
Scene 2a, alternative Scene 3b, alternative Ad la, alternative Ad 2c, Scene 4, Ad 3, and 
alternative Ad 4b. 

15 The same matching and selection process occurs with Scenes 3, 3a and 3b, as well as the 

advertisements. Thus, Ad 1 may be provided in four variations, each targeting a different 
demographic characteristics, be it the users sex, income, geographic location, or any other 
desired characteristic. As previously mentioned, providing such targeted ads provides a more 
interesting and enjoyable watching experience for the user, and thus enhances the likelihood that 

20 the user will watch the advertisement rather than switch channels or walk away. 

It is important to point out that the method of targeted advertising enabled by the 
invention is different from providing advertising messages by a programming guide. The present 
invention enables showing advertisements in an apparently conventional, linear manner whereby 
the broadcast of a television program is halted periodically by the head-end to broadcast 

25 advertisements. "The advertisements thus shown are targeted to the individual user watching a 

particular TV set, but the user is not prompted in any manner to select between alternative ads or 
in any other way informed of the fact that alternative ads are being received by the set-top box. 
This is a different manner of providing targeted advertisements from providing targeted 
advertisements by a programming guide because, in a conventional programming guide context, 

30 the TV screen is typically split in at least two different areas, wherein one area shows program 
listings and the other area displays advertisements. Thus, in the context of a program guide, the 
user does not receive any programming at all, but rather only program listings and 
advertisements. 
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As shown in Fig. 44, alternate scenes and advertisements are preferably of the same 
length and thus occupy the same time slots within the program time line. However, it also 
possible to provide alternative advertisements and scenes of different lengths, such as those 
comprising television program 870. In this embodiment, the set top box would additionally 
5 fulfill the role of scheduler by calculating the length of each scene and advertisement and 
scheduling the scenes and advertisements for showing to the user in a seamless, apparently 
linear program with no interruptions or overlaps. In this embodiment both the scenes and the 
advertisements may be comprised of any combination of broadcast and previously recorded, 
stored scenes and advertisements. Thus, by way of example, in program 870 alternative Scene 

10 2a may be shorter than alternative Scene 2, which would necessitate showing Scene 3 b and Ad 
la at an earlier time. In this case, an additional Ad A may be shown between Ad la and Ad 2c, 
where Ad A is selected by the set top box because its target profile fits the demographic profile 
of the user and because it is of the appropriate length of time. 

Providing a customized program will also be able to reach a wider audience by catering 

1 5 more individually to specific traits and characteristics of the general viewing population. 

Furthermore, providing such customized content may also allow greater artistic and expressive 
freedom for producers of programming, as they will be able to explore different variations of the 
same story line and edit various scenes for various audiences. Another possible use for the 
customized linear programming enabled by the present invention is providing highly 

20 individualized services such as news, weather, stock market quotes, shopping events, 
instructional videos, etc. 

The popularity and acceptance of the system of the present invention will depend largely 
upon the cost to the end users, i.e. the viewers. As such, in one preferred financial arrangement, 
the user pays a set price for the control system, i.e. the hardware, connects the control system to 

25 his TV and incoming cable or satellite dish line, and enjoys the personalized services provided 
by the control system at no further cost. The system administrator will provide tagged 
advertisements for broadcast by the broadcast network and charge a fee to the advertisement 
client. The actual advertisements may be provided by an advertising content supplier or the 
advertising client itself (e.g. a truck manufacturer), to which the system administrator tags the 

30 meta data. The fee may be based upon the total number of control systems installed and/or the 

target audience that the advertising client wishes to reach. Thus, if a client wishes to air 

advertisements that are tagged to reach a relatively wide audience of viewers who have 

purchased and presumably installed the control system of the invention, the fee charged will be 

proportionally higher than the fee charged for a more narrowly focused advertisement. 
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statutes, those skilled in this art will understand how to make changes and modifications in the 
present invention to meet their specific requirements or conditions. Such changes and 
modifications may be made without departing from the scope and spirit of the invention as set 
5 forth in the following claims. 
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What is claimed is: 

1 . A method to deliver customized linear video programming to each of a plurality of 
individual viewers, comprising: 

processing information indicative of preferences of each of the plurality of viewers to 
develop viewer characteristics information for each of the viewers; and 

configuring a set of video programming segments for each viewer, at least one of the 
video programming segments selected from a plurality of available video programming 
segments, to create an apparently linear program for linear delivery to the viewer in accordance 
with the viewer characteristics information. 

2. The method of claim 1, wherein configuring a set of video programming segments 
further comprises: 

selecting at least one broadcast video programming segment for linear delivery to the 
viewer concurrent with the broadcast of the video segment to create the apparently linear 
program. 

3. The method of claim 1, wherein configuring a set of video programming segments 
further comprises: - 

selecting at least one video programming segment stored on a storage medium for linear 
20 delivery to the viewer to create the apparently linear program. 

4. The method of claim 1, wherein processing information indicative of preferences of each 
of the plurality of viewers comprises: 

processing information indicative of television program viewing preferences of each of 
25 the plurality of viewers. 

5. The method of claim 4, wherein processing information indicative of television program 
viewing preferences of each of the plurality of viewers comprises: 

processing information indicative of television programs watched by each of the plurality 
30 of viewers. 

6. The method of claim 4, wherein processing information indicative of television program 
viewing preferences of each of the plurality of viewers comprises: 
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processing information indicative of television programs recorded by each of the 
plurality of viewers. 

7. The method of claim 4, wherein processing information indicative of television program 
5 viewing preferences of each of the plurality of viewers comprises: 

processing information indicative of television programs not watched by each of the 
plurality of viewers. 

8. The method of claim 4, wherein processing information indicative of television program 
10 viewing preferences of each of the plurality of viewers comprises: 

processing information indicative of television program guide information requested by 
each of the plurality of viewers. 

9. The method of claim 4, wherein processing information indicative of television program 
1 5 viewing preferences of each of the plurality of viewers comprises: 

processing information indicative of television program guide information not requested 
by each of the plurality of viewers. 

10. The method of claims 5, 6, 7, 8, or 9, wherein processing information comprises: 
20 processing electronic program guide information. 

1 1 . The method of claim 1 , wherein processing information indicative of preferences of each 
of the plurality of viewers comprises: 

processing information indicative of preferences of each of the plurality of viewers 
25 provided by each of the viewers in response to queries. 

12. The method of claim 1, wherein processing information indicative of television program 
viewing preferences of each of the plurality of viewers comprises: 

processing information indicative of preferences other than television program viewing . 
30 preferences of each of the plurality of viewers. 

13. The method of claim 12, wherein processing information indicative of preferences other 
than television program viewing preferences of each of the plurality of viewers comprises: 

-60- 

BNSDOCiD: <WO 01 1725QA1_I_> 



WO 01/17250 



PCT/US00/23868 



processing information indicative of musical preferences of each of the plurality of 
viewers. 

14. The method of claim 12, wherein processing information indicative of preferences other 
5 than television program viewing preferences of each of the plurality of viewers comprises: 

processing information indicative of reading preferences of each of the plurality of 
viewers. 

15. The method of claim 12, wherein processing information indicative of preferences other 
1 0 than television program viewing preferences of each of the plurality of viewers comprises: 

processing information indicative of shopping preferences of each of the plurality of 
viewers. 

16. The method of claim 12, wherein processing information indicative of preferences other 
1 5 than television program viewing preferences of each of the plurality of viewers comprises: 

processing information indicative of preferences other than television program viewing 
preferences of each of the plurality of viewers acquired from the group of sources comprising 
on-line music clubs, on-line book clubs, on-line special interest clubs and organizations, and on- 
line retailers and merchants. 

20 

1 7. The method of claim 1 , 4, 1 1 , or 12, wherein processing information indicative of 
preferences of each of the plurality of viewers to develop viewer characteristics information for 
each of the viewers comprises: 

processing information indicative of preferences of each of the plurality of viewers to 
25 develop a television pirogram viewing preference profile for each of the viewers. 

18. The method of claim 17, wherein processing information indicative of preferences of 
each of the plurality of viewers to develop a television program viewing preference profile for 
each of the viewers comprises: 

30 processing information indicative of preferences of each of the plurality of viewers in 

accordance with a predictive model based on the viewing habits of a representative sample of 
the general population to develop a television program viewing preference profile for each of the 
viewers. 
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19. The method of claim 18, wherein processing the television program viewing preference 
profile developed for each of the viewers in accordance with a probabilistic model based on the 
viewing habits of a sample of the general population comprises: 

constructing a Bayesian network to calculate maximum a posteriori values for the 
5 parameters of the predictive model to predict television program viewing preferences for each 
viewer. 

20. The method of claim 17, wherein processing information indicative of preferences of 
each of the plurality of viewers to develop a television program viewing preference profile for 

10 each of the viewers comprises: 

processing information indicative of preferences of each of the plurality of viewers to 
deduce hidden traits of each viewer. 

21 . The method of claim 17, wherein processing information indicative of preferences of 
15 each of the plurality of viewers to develop a television program viewing preference profile for 

each of the viewers comprises: 

processing information indicative of preferences of each of the plurality of viewers to 
deduce associated traits of each viewer. 

20 22. The method of claim 17, further comprising: 

processing the television program viewing preference profile developed for each of the 
viewers to develop demographic information for each of the viewers. 

23 . The method of claim 22, wherein processing the television program viewing preference 
25 profile developed for each of the viewers to develop demo graphic information for each of the 

viewers comprises: 

processing the television program viewing preference profile developed for each of the 
viewers in accordance with a predictive model based on the viewing habits of a representative 
sample of the general population to develop demographic information for each of the viewers. 

30 

24. The method of claim 23, wherein processing the television program viewing preference 
profile developed for each of the viewers in accordance with a predictive model based on the 
viewing habits of a representative sample of the general population comprises: 
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processing the television program viewing preference profile developed for each of the 
viewers in accordance with a predictive model that predicts each demographic trait of the viewer 
based on viewer preferences for television programs that attract a higher proportion of viewers 
exhibiting the demographic trait than is exhibited by the representative sample.of the general 
5 population. 

25. The method of claim 24, wherein processing the television program viewing preference 
profile developed for each of the viewers in accordance with a predictive model that predicts 
each demographic trait of the viewer based on viewer preferences for television programs that 

10 attract a higher proportion of viewers exhibiting the demographic trait than is exhibited by the 
representative sample of the general population further comprises: 

processing the television program viewing preference profile developed for each of the 
viewers in accordance with a predictive model that predicts each demographic trait of the viewer 
based on viewer preferences for television programs that attract a higher proportion of viewers 

1 5 exhibiting the demographic trait than is exhibited by the representative sample of the general 

population and that exhibit minimal demographic trait correlation with other television programs 
that attract a higher proportion of viewers exhibiting other demographic traits than is exhibited 
by the representative sample of the general population. 

20 26. The method of claim 1, wherein processing information indicative of preferences of each 
of the plurality of viewers to develop viewer characteristics information for each of the viewers 
comprises: 

processing information indicative of preferences of each of the plurality of viewers to detfeiop 
demographic information for each of the viewers. 

25 V - ; " - " ■ ' : 

27. The method of claim 26, wherein processing information indicative of preferences of 

develop 

each of the plurality of viewers to Y demographic information for each of the viewers comprises: 

processing information indicative of preferences of each of the viewers in accordance 
with a predictive model based on the viewing habits of a representative sample of the general 
30 population to develop demographic information for each of the viewers. 

28. The method of claim 27, wherein processing information indicative of preferences of 
each of the viewers in accordance with a predictive model based on the viewing habits of a 

representative sample of the general population comprises: 
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processing information indicative of preferences of each of the viewers in accordance 
with a predictive model that predicts each demographic trait of the viewer based on viewer 
preferences for television programs that attract a higher proportion of viewers exhibiting the 
demographic trait than is exhibited by the representative sample of the general population. 

5 

29. The method of claim 28, wherein processing information indicative of preferences of 
each of the viewers in accordance with a predictive model that predicts each demographic trait 
of the viewer based on viewer preferences for television programs that attract a higher 
proportion of viewers exhibiting the demographic trait than is exhibited by the representative 

10 sample of the general population further comprises: 

processing information indicative of preferences of each of the viewers in accordance 
with a predictive model that predicts each demographic trait of the viewer based on viewer 
preferences for television programs that attract a higher proportion of viewers exhibiting the 
demographic trait than is exhibited by the representative sample of the general population and 

1 5 that exhibit minimal demographic trait correlation with other television programs that attract a * 
higher proportion of viewers exhibiting other demographic traits than is exhibited by the 
representative sample of the general population. 

30. The method of claim 26, wherein processing information indicative of preferences of 
20 each of the plurality of viewers tc^demographic information for each of the viewers further 

comprises: deif£kp 

developing a television program viewing preference profile for each of the viewers in 
accordance with the viewer demographic information. 

25 31. The method of claim 1 7 or 30, wherein processing information indicative of preferences 
of each of the plurality of viewers to develop a television program viewing preference profile for 
each of the viewers further comprises: 

processing information indicative of preferences of each of the plurality of viewers to 
develop a plurality of television program viewing preference profiles for each of the viewers. 

30 

32. The method of claim 31, wherein processing information indicative of preferences of 
each of the plurality of viewers to develop a plurality of television program viewing preference 
profiles for each of the viewers comprises: 
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processing information indicative of preferences of each of the plurality of viewers to 
develop a plurality of television program viewing preference profiles for each of the viewers, 
each profile describing the television program viewing preferences of the viewer at a different 
time of day, time of week, or season. 

5 

33. The method of claim 17 or 30, wherein processing information indicative of preferences 
of each of the plurality of viewers to develop a television program viewing preference profile for 
each of the viewers further comprises: 

processing information indicative of preferences of each of a plurality of viewers 
10 accessing the same television equipment to develop a plurality of television program viewing 
preference profiles for each of the viewers. 

34. The method of claim 33, wherein configuring a set of video programming segments 
comprises: 

1 5 configuring a set of video programming segments for the viewers accessing the same * 

television equipment, at least one of the video programming segments selected from a plurality 
of available video programming segments, to create an apparently linear program for linear 
delivery to the viewers in accordance with characteristics information of the viewers. 

20 35. The method of claims 1 7 or 30, wherein configuring a set of video programming 
segments for each viewer comprises: 

configuring a set of video programming segments for each viewer, at least one of the 
video programming segments selected from a plurality of available video programming 
segments, for recording in accordance with the television program viewing preference profile of 

25 the viewer. 

36. The method of claim 1 , wherein configuring a set of video programming segments 
comprises: 

configuring a set of video programming segments selected from the group of video 
30 programming segments comprising advertising, entertainment, news, weather, financial, sports, 
educational, and shopping programming. 

37. The method of claim 36, wherein configuring a set of video programming segments 
further comprises: 
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configuring a set of video programming segments for each viewer, at least one of the 
video programming segments selected from a plurality of available video programming 
segments, for recording in accordance with the viewer characteristics information. 

5 38. The method of claim 36, wherein configuring a set of video programming segments 
further comprises: " 

configuring a set of video programming segments for each viewer, at least one of the 
video programming segments selected from a plurality of available video programming 
segments, to create an apparently linear program for linear delivery to the viewer on at least one 
10 dedicated channel in accordance with the viewer characteristics information. 

39. The method of claim 36, wherein configuring a set of video programming segments 
further comprises: 

presenting a listing of the set of video programming segments to the user for the user to 
1 5 select therebetween. 

* 

40. The method of claim 1 , wherein configuring a set of video programming segments 
further comprises: 

selecting one or more of the video programming segments from a plurality of available 
20 video programming segments to create an apparently linear program for linear delivery to the 

viewer, the program exhibiting content customized in accordance with the viewer characteristics 
information. 

4 1 . The method of claim 1 , wherein configuring a set of video programming segments 
25 further comprises: 

selecting one or more of the video programming segments from a plurality of available 
video programming segments to create an apparently linear program for linear delivery to the 
viewer, the program exhibiting content targeted to the viewer characteristics information. 

30 42. The method of claim 41, wherein selecting one or more of the video programming 
segments comprises: 

selecting one or more of the video programming segments from a plurality of available 
video programming segments to create an apparently linear program for linear delivery to the 
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viewer, the program exhibiting content targeted to the viewer characteristics information by the 
providers of the video programming segments. 

43. The method of claim 1 7, wherein configuring a set of video programming segments 
5 further comprises: 

selecting one or more of the video programming segments from a plurality of available 
video programming segments to create an apparently linear program for linear delivery to the 
viewer, the program exhibiting content customized in accordance with the viewer television 
program viewing preference profile. 

10 

44. The method of claim 17, wherein configuring a set of video programming segments 
further comprises: 

selecting one or more of the video programming segments from a plurality of available 
video programming segments to create an apparently linear program for linear delivery to the 
1 5 viewer, the program exhibiting content targeted to the viewer television program viewing 
preference profile. 

45. The method of claim 44, wherein selecting one or more of the video programming 
segments comprises: 

20 selecting one or more of the video programming segments from a plurality of available 

video programming segments to create an apparently linear program for linear delivery to the 
viewer, the program exhibiting content targeted to the viewer television program viewing 
preference profile by the providers of the video programming segments. 

25 46. The method of claim 45, wherein selecting one or more of the video programming 
- segments comprises: 

selecting one or more of the video programming segments from a plurality of available 
advertising video programming segments to create an apparently linear program for linear 
delivery to the viewer, the program exhibiting advertising content targeted to the viewer 
30 television program viewing preference profile by the providers of the advertising video 
programming segments. 

47. The method of claim 26, wherein configuring a set of video programming segments 
further comprises: 

-67- 



BNSDOCID: <WO 01 17250A1_I_> 



WO 01/17250 



PCT/US00/23868 



selecting one or more of the video programming segments from a plurality of available 
video programming segments to create an apparently linear program for linear delivery to the 
viewer, the program exhibiting content customized in accordance with the viewer demographic 
information. 

5 

48. The method of claim 26, wherein configuring a set of video programming segments 
further comprises: 

selecting one or more of the video programming segments from a plurality of available 
video programming segments to create an apparently linear program for linear delivery to the 
10 viewer, the program exhibiting content targeted to the viewer demographic information. 

49. The method of claim 48, wherein selecting one or more of the video programming 
segments comprises: 

selecting one or more of the video programming segments from a plurality of available 
15 video programming segments to create an apparently linear program for linear delivery to the 
viewer, the program exhibiting content targeted to the viewer demographic information by the 
providers of the video programming segments. 

50. The method of claim 49, wherein selecting one or more of the video programming 
20 segments comprises: 

selecting one or more of the video programming segments from a plurality of available 
advertising video programming segments to create an apparently linear program for linear 
delivery to the viewer, the program exhibiting advertising content targeted to the viewer 
demographic information by the providers of the advertising video programming segments. 
25 : - 

51 . The method of claims 40, 41 , 43, or 47, wherein configuring a set of video programming 
segments comprises: 

configuring a set of video programming segments selected from the group of video 
programming segments comprising advertising, entertainment, news, weather, financial, sports, 
30 educational, and shopping programming. 
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52. A system to deliver customized linear video programming to each of a plurality of 
individual viewers, comprising: 

a processor to process information indicative of preferences of each of the plurality of 
viewers to develop viewer characteristics information for each of the viewers; and 
5 a programmer to configure a set of video programming segments for each viewer, at least 

one of the video programming segments selected from a plurality of available video 
programming segments, to create an apparently linear program for linear delivery to the viewer 
in accordance with the viewer characteristics information. 

10 53. The system of claim 52, wherein the programmer further comprises: 

a programmer to select at least one broadcast video programming segment for linear 
delivery to the viewer concurrent with the broadcast of the video segment to create the 
apparently linear program. 

1 5 54. The system of claim 52, wherein the programmer further comprises: 

a programmer to select at least one video programming segment stored on a storage 
medium for linear delivery to the viewer to create the apparently linear program. 

55. The system of claim 52, wherein the processor comprises: 

20 a processor to process information indicative of television program viewing preferences 

of each of the plurality of viewers. 

56. The system of claim 55, wherein the processor comprises: 

a processor to process information indicative of television programs watched by each of 
25 the plurality of viewers. 

57. The system of claim 55, wherein the processor comprises: 

a processor to process information indicative of television programs recorded by each of 
the plurality of viewers. 

30 

58. The system of claim 55, wherein the processor comprises: 

a processor to process information indicative of television programs not watched by each 
of the plurality of viewers. 
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59. The system of claim 55, wherein the processor comprises: 

a processor to process information indicative of television program guide information 
requested by each of the plurality of viewers. 

5 60. The system of claim 55, wherein the processor comprises: 

a processor to process information indicative of television program guide information not 
requested by each of the plurality of viewers. 

61 . The system of claims 56, 57, 58, 59, or 60, wherein the processor comprises: 
10 a processor to process electronic program guide information. 

62. The system of claim 52, wherein the processor comprises: 

a processor to process information indicative of preferences of each of the plurality of 
viewers provided by each of the viewers in response to queries. 

15 

» 

63. The system of claim 52, wherein the processor comprises: 

a processor to process information indicative of preferences other than television 
program viewing preferences of each of the plurality of viewers. 

20 64. The system of claim 63, wherein the processor comprises: 

a processor to process information indicative of musical preferences of each of the 
plurality of viewers. 

65. The system of claim 63, wherein the processor comprises: 

25 a processor to process information indicative of reading preferences of each of the 

plurality of viewers. 

66. The system of claim 63, wherein the processor comprises: 

a processor to process information indicative of shopping preferences of each of the 
30 plurality of viewers. 

67. The system of claim 63, wherein the processor comprises: 

a processor to process information indicative of preferences other than television 

program viewing preferences of each of the plurality of viewers acquired from the group of 
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sources comprising on-line music clubs, on-line book clubs, on-line special interest clubs and 
organizations, and on-line retailers and merchants. 

68. The system of claim 52, 55, 62, or 63, wherein the processor comprises: 

5 a processor to process information indicative of preferences of each of the plurality of 

viewers to develop a television program viewing preference profile for each of the viewers. 

69. The system of claim 68, wherein the processor comprises: 

a processor to process information indicative of preferences of each of the plurality of 
10 viewers in accordance with a predictive model based on the viewing habits of a representative 
sample of the general population to develop a television program viewing preference profile for 
each of the viewers. 

70. The system of claim 69, wherein the processor further comprises: 

15 a Bayesian network to calculate maximum a posteriori values for the parameters of the 

predictive model to predict television program viewing preferences for each viewer. 

71 . The system of claim 68, wherein the processor comprises: 

a processor to process information indicative of preferences of each of the plurality of 
20 viewers to deduce hidden traits of each viewer. 



72. The system of claim 68, wherein the processor comprises: 

a processor to process information indicative of preferences of each of the plurality of 
viewers to deduce associated traits of each viewer. 

25 

73. The system of claim 68, further comprising: 

a processor to process the television program viewing preference profile developed for 
each of the viewers to develop demographic information for each of the viewers. 

30 74. The system of claim 73, wherein the processor comprises: 

a processor to process the television program viewing preference profile developed for 
each of the viewers in accordance with a predictive model based on the viewing habits of a 
representative sample of the general population to develop demographic information for each of 
the viewers. 
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75. The system of claim 74, wherein the processor comprises: 

a processor to process the television program viewing preference profile developed for 
each of the viewers in accordance with a predictive model that predicts each demographic trait 
5 of the viewer based on viewer preferences for television programs that attract a higher 

proportion of viewers exhibiting the demographic trait than is exhibited by the representative 
sample of the general population. 

76. The system of claim 75, wherein the processor comprises: 

10 a processor to process the television program viewing preference profile developed for 

each of the viewers in accordance with a predictive model that predicts each demographic trait 
of the viewer based on viewer preferences for television programs that attract a higher 
proportion of viewers exhibiting the demographic trait than is exhibited by the representative 
sample of the general population and that exhibit minimal demographic trait correlation with 

15 other television programs that attract a higher proportion of viewers exhibiting other 

demographic traits than is exhibited by the representative sample of the general population. 

77. The system of claim 52, wherein the processor comprises: 

a processor to process information indicative of preferences of each of the plurality of 
20 viewers tt^demographic information for each of the viewers. 

develop 

78. The system of claim 77, wherein the processor comprises: 

a processor to process information indicative of preferences of each of the viewers in 
accordance with a predictive model based on the viewing habits of a representative sample of 
25 the general population to develop demographic information for each of the viewers. 

79. The system of claim 78, wherein the processor comprises: 

a processor to process information indicative of preferences of each of the viewers in 
accordance with a predictive model that predicts each demographic trait of the viewer based on 
30 viewer preferences for television programs that attract a higher proportion of viewers exhibiting 
the demographic trait than is exhibited by the representative sample of the general population. 

80. The system of claim 79, wherein the processor comprises: 
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a processor to process information indicative of preferences of each of the viewers in 
accordance with a predictive model that predicts each demographic trait of the viewer based on 
viewer preferences for television programs that attract a higher proportion of viewers exhibiting 
the demographic trait than is exhibited by the representative sample of the general population 
5 and that exhibit minimal demographic trait correlation with other television programs that attract 
a higher proportion of viewers exhibiting other demographic traits than is exhibited by the 
representative sample of the general population. 

8 1 . The system of claim 77, wherein the processor comprises: 

10 a processor to develop a television program viewing preference profile for each of the 

viewers in accordance with the viewer demographic information. 

82. The system of claim 68 or 81 , wherein the processor comprises: 

a processor to process information indicative of preferences of each of the plurality of 
1 5 viewers to develop a plurality of television program viewing preference profiles for each of the ' 
viewers. 

83. The system of claim 82, wherein the processor comprises: 

a processor to process information indicative of preferences of each of the plurality of 
20 viewers to develop a plurality of television program viewing preference profiles for each of the 
viewers, each profile describing the television program viewing preferences of the viewer at a 
different time of day, time of week, or season. 

84. The system of claim 68 or 8 1 , wherein the processor comprises: 

25 a processor to process information indicative of preferences of each of a plurality of 

viewers accessing the same television equipment to develop a plurality of television program 
viewing preference profiles for each of the viewers. 

85. The system of claim 84, wherein the programmer comprises: 

30 a programmer to configure a set of video programming segments for the viewers 

accessing the same television equipment, at least one of the video programming segments 
selected from a plurality of available video programming segments, to create an apparently 
linear program for linear delivery to the viewers in accordance with characteristics information 
of the viewers. 
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86. The system of claims 68 or 81, wherein the programmer comprises: 

a programmer to configure a set of video programming segments for each viewer, at least 
one of the video programming segments selected from a plurality of available video 
5 programming segments, for recording in accordance with the television program viewing 
preference profile of the viewer. 

87. The system of claim 52, wherein the programmer comprises: 

a programmer to configure a set of video programming segments selected from the group 
10 of video programming segments comprising advertising, entertainment, news, weather, 
financial, sports, educational, and shopping programming. 

88. The system of claim 87, wherein the programmer comprises: 

a programmer to configure a set of video programming segments for each viewer, at least 
15 one of the video programming segments selected from a plurality of available video 

programming segments, for recording in accordance with the viewer characteristics information. 

89. The system of claim 87, wherein the programmer comprises: 

a programmer to configure a set of video programming segments for each viewer, at least 
20 one of the video programming segments selected from a plurality of available video 

programming segments, to create an apparently linear program for linear delivery to the viewer 
on at least one dedicated channel in accordance with the viewer characteristics information. 

90. The system of claim 87, wherein the programmer comprises: 

25 a programmer to present a listing of the set of video programming segments to the user 

for the user to select therebetween. 

91 . The system of claim 52, wherein the programmer comprises: 

a programmer to select one or more of the video programming segments from a plurality 
30 of available video programming segments to create an apparently linear program for linear 

delivery to the Viewer, the program exhibiting content customized in accordance with the viewer 
characteristics information. 

92. The system of claim 52, wherein the programmer comprises: 
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a progranimer to select one or more of the video programming segments from a plurality 
of available video programming segments to create an apparently linear program for linear 
delivery to the viewer, the program exhibiting content targeted to the viewer characteristics 
information. 

93. The system-of claim 92, wherein the programmer comprises: 
a programmer to select one or more of the video programming segments from a plurality 

of available video programming segments to create an apparently linear program for linear 
delivery to the viewer, the program exhibiting content targeted to the viewer characteristics 
information by the providers of the video programming segments. 

94. The system of claim 68, wherein the programmer comprises: 
a programmer to select one or more of the video programming segments from a plurality 

of available video programming segments to create an apparently linear program for linear 
1 5 delivery to the viewer, the program exhibiting content customized in accordance with the viewer 
television program viewing preference profile. 

95. The system of claim 68, wherein the programmer comprises: 

a programmer to select one or more of the video programming segments from a plurality 
20 of available video programming segments to create an apparently linear program for linear 

delivery to the viewer, the program exhibiting content targeted to the viewer television program 
viewing preference profile. 

96. The system of claim 95, wherein the programmer comprises: 

25 a programmer to select one or more of the video programming segments from a plurality 

of available video programming segments to create an apparently linear program for linear 
delivery to the viewer, the program exhibiting content targeted to the viewer television program 
viewing preference profile by the providers of the video programming segments. 

30 97. The system of claim 96, wherein the programmer comprises: 

a programmer to select one or more of the video programming segments from a plurality 
of available advertising video programming segments to create an apparently linear program for 
linear delivery to the viewer, the program exhibiting advertising content targeted to the viewer 
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television program viewing preference profile by the providers of the advertising video 
programming segments. 

98. The system of claim 77, wherein the programmer comprises: 
5 a programmer to select one or more of the video programming segments from a plurality 

of available video programming segments to create an apparently linear program for linear 
delivery to the viewer, the program exhibiting content customized iri accordance with the viewer 
demographic information. 

10 99. The system of claim 77, wherein the programmer comprises: 

a programmer to select one or more of the video programming segments from a plurality 
of available video programming segments to create an apparently linear program for linear 
delivery to the viewer, the program exhibiting content targeted to the viewer demographic 
information. 

15 

» 

100. The system of claim 99, wherein the programmer comprises: 

a programmer to select one or more of the video programming segments from a plurality 
of available video programming segments to create an apparently linear program for linear 
delivery to the viewer, the program exhibiting content targeted to the viewer demographic 
20 information by the providers of the video programming segments. 

101 . The system of claim 100, wherein the programmer comprises: 

a programmer to select one or more of the video programming segments from a plurality 
of available advertising video programming segments to create an apparently linear program for 
25 linear delivery to the viewer, the program exhibiting advertising content targeted to the viewer 
demographic information by the providers of the advertising video programming segments. 

102. The system of claims 91 , 92, 94, or 98, wherein the programmer comprises: 

a programmer to configure a set of video programming segments selected from the group 
30 of video programming segments comprising advertising, entertainment, news, weather, 
financial, sports, educational, and shopping programming. 
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Examples of Program Information 



Title = Seiniieid 
Program Type = Sitcom 
Category = Comedy 
Actors = ( Actorl , Actor2) 



7 



Title - US Debt Report 
Program Type - News article 
Category = US Govt. Financial 
People Mentioned - ( Bill Clinton, 
Alan Greenspan) 



Example 1 



Example 2 



Figure 3 
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Examples for Liking for viewtr N 



Movie 
Adventure 
Sports 
Mad About You 
dynamic trait I 
Dvnamic trait 2 
NBC NEWS 
FRIDAY Movie 
Premier Mad About You 



Movie-, 1 - 
Adventure - i 
Sports ~ OJ 
Mad About You- 5 
dynamic trait 1 * 3 
Dynamic trait 2-5 
NBC NEWS - 13 
FRIDAY Movie - 18 
Premier Mad About You - 15 
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METHOD AND APPARATUS FOR DELIVERY OF 
TARGETED VIDEO PROGRAMMING 

FIELD OF THE INVENTION 

5 This invention relates generally to the delivery of targeted video programming to a 

viewer, and more particularly to the determination of various viewer characteristics for the linear 
delivery of video programming targeted to the viewer's characteristics to create a targeted or 
customized, apparently linear television program. 

10 BACKGROUND OF THE INVENTION 

Currently, recording of television programs by individuals for viewing at a later time is 
generally performed using commercially available Video Cassette Recorders (VCRs). Typically, 
a VCR may be either manually placed into a record mode or may be programmed to record a 
selected program at a later time. To program the VCR, the user either enters a date, time and 
1 5 channel of the program desired to be recorded, or enters an identification code of the desired 
program. 

Viewers of television programming increasingly have more choices as to which 
programs to view. For example, cable television provides a dramatic increase in the number of 
channels available to a viewer in comparison to the channels available by way of a conventional 

20 television antenna. Digital satellite systems provide even more viewing choices. Digital 

broadcast of programs over cable television systems is expected to further increase the number 
of channels available to viewers. 

One effect of the increase in the number of viewing choices is increased difficulty in 
deciding which programs to watch. People, particularly those with busy schedules, may not have 

25 the time to select and view programs to determine which programs they may or may not like. 
Programs that may otherwise be desirable to a viewer may never be watched if the program is 
broadcast at a time that is inconvenient for the viewer. Users may select certain programs for 
viewing to determine if they like the program. However, with several hundred program 
selections each week, this task can take a considerable amount of time and is likely to cause 

30 certain desirable programs to be overlooked. 

There are companies, who sample television-viewing habits of the population by 
monitoring the programs watched by a very small set of TV viewers. These companies collect 
other demographic information also about the people they monitor for the generation of the 
sample. These samples give valuable information about the television viewing habits of the 
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population covered by the sample. By analyzing these samples periodically a general 
mathematical model can be formed which explains the behavioral patterns of the population in 
television viewing. Each individual viewer would have a very personal liking for television 
programs which.may vary considerably from a model derived from the sample behaviors of 
5 general population. At the same time a mathematical model derived by monitoring the behavior 
of a single user may be inaccurate because of the limited amount of information which can be 
gathered by watching only a single user for a short period of time. Sending the entire sample of 
behavioral pattern of the sample population to every viewer device to aid computation of the 
mathematical model is counter-productive because of the high cost of bandwidth required to 

10 transmit this information to each device and the processing power and memory requirement for 
the viewer device to process this information. Sending personal viewing habits of every user to a 
server to compute the mathematical model for the individual user would raise privacy concerns 
for the viewer and also requires a return channel from the viewer device to the server. 
With a mechanism to automatically determine personal preferences of a viewer 

1 5 accurately, a very personal TV viewing environment can be presented to the viewer. In case of 
households with multiple members, by correctly identifying individual members and their 
preferences, an apparatus can provide an entertainment experience which is most pleasurable to 
the individual viewer. 

Methods have been developed for providing text data to viewers. A closed captioned 

20 encoding technique transmits text data in synchronization with its associated video data by 
inserting the closed captioned text data into a vertical blanking interval of the video signal. 
However, the closed captioned text data must be inserted into the vertical blanking interval of 
the video signal by the producer of the video programming. As a result, the vertical blanking 
interval of the video signal cannot be used by the head end operator to insert other text data such 

25 as sports, weather, stock market, news, advertising and other data. 

Electronic program guides (hereafter "EPG") provides viewers with on-screen listings of 
upcoming television programs on cable television channels. The EPG is provided by an EPG 
data service. EPG data is converted into a video signed at the cable head. The EPG data is 
converted into a video signal at the cable head and transmitted to the viewer's television by a 

30 dedicated cable television channel. After tuning to the dedicated cable television channel, the 

viewer then wait for the programming for the desired time period is displayed. Often, when EPG 
data is used, the cable head end operator must dedicate a separate cable television channel to the 
EPG data and create video signals from the EPG data that are provided by the EPG service 
providers. 
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One method of solving this problem is modulation of the EPG data onto an FM carrier 
and transmitting the FM carrier with a video signal on one of the cable television channels. A 
dedicated peripheral device is provided at the viewer's television tuner that demodulates the EPG 
data from the FM carrier. The EPG data is then stored until the viewer requests presentation of 
5 the EPG data on the viewer's television. Upon selection the EPG data is then displayed on the 
viewer's television in place of the other video programming. 

A data controller is disclosed in U.S. Patent No. 5,579,055 that manages the flow of text 
data and electronic EPG data. 

Additionally, preference agents for television programs have used Bayesian methods. 
1 0 See for example U.S. Patent No. 5,704,0 1 7 (hereafter the '"017 Patent"). However, in the '0 1 7 
PatenL a collaborative filtering system is used to predict a desired preference of a television 
viewer based on attributes of the viewer. A system implementing '017 would need to 
communicate to a belief network through a two way communication network, disclosing private 
viewer information to the network. '017 does not leverage the rich EPG information available 
1 5 about television programs which can be used to identify various traits which contribute to 
viewer's choices. 

SUMMARY OF THE INVENTION 

An object of the present invention is to provide a method and apparatus to select and 
deliver video data targeted to a specific viewer, typically a television viewer. The video data 

20 may be targeted in accordance with various characteristics of the viewer, including viewing 
characteristics, demographic characteristics, shopping characteristics, and others. 

Accordingly, an object of the present invention is to provide a method and apparatus for 
determining user preferences for television viewers. The present invention defines a method to 
analyze sample viewing habits along with associated demographic and other characteristics of 

25 the general population to generate parameters of a mathematical model that explains the 

relationship between the viewing patterns of the population and the associated characteristics, 
and that can be communicated to viewer devices using a one way communication channel such 
as a broadcast network. The present invention also defines methods for using these parameters 
by viewer devices to compute the user preferences of individual viewers without divulging 

30 private information about viewing habits to the outside world. 

The invention also details an apparatus that relies upon a viewer's computed preferences 
to store and present broadcast content that may be of interest to one or more individual viewers 
utilizing the same apparatus to view video data. Thus, the invention also discloses a method to 
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compute the preferences of multiple viewers accessing the same viewing device, e.g. a multiple 
viewer household with a single television set. 

In another aspect, the present invention determines viewing preferences of a user by 
monitoring programs viewed by the user and causes recording of programs corresponding to the 
5 user's preferences. In accordance with the principles of the present invention, apparatus for 

causing recordation of television programs comprises a preference agent for causing retrieval of 
attribute information corresponding to each television program viewed by a user of the 
apparatus. The preference agent generates classification information indicative of viewing 
preferences of the user as a function of the attribute information. A recording manager causes 

1 0 recordation and storage to a storage device of television programs having attribute information 
that matches the classification information. 

In a further aspect, the invention determines viewing preferences of a user by processing 
attribute information associated with programs viewed by the user and creating a profile of 
characteristics describing the viewer. The characteristics may include, but are not limited to, 

1 5 viewing preferences, demographic information (e.g. age, sex, education, occupation, income, 
political and religious affiliations, marital status, sexual preferences, ethnic background, 
geographical location of home and work), reading preferences, shopping preferences, music 
preferences 

Embodiments employing the principles of the present invention advantageously cause 

20 recordation of programs that match certain viewing habits of the viewer. Such embodiments 

therefore provide the viewer with stored programs that match certain viewing preferences of the 

user, which can be viewed at the viewer's leisure. The viewer is therefore relieved of the burden 

of deciding which programs from among several hundred possible programs to watch. 

In accordance with a further aspect of the present invention, programs may be recorded 

25 for storage in accordance with available capacity of the storage device. Moreover, programs 

may be deleted in response to selections by the user or based upon a priority, indicated by 

viewing preferences of the user, in which programs having lowest priority are deleted first to 

make room for newly recorded programs. The priority of programs may also be a function of 

time, in which more recently recorded programs are given higher priority. 

30 In accordance with further aspects of the invention, determining which programs to 

record may also be a function of priority in which program s specified for recordation are given 

highest priority, followed by programs having attribute information corresponding to one or 

more user specified criteria, then followed by programs having attribute information 

corresponding to the recordation preference information. 
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In accordance with further aspects of the invention, the user specified requests may be in 
the form of a first type of request comprising information indicative of a specific program and a 
second type of request comprising specifications indicative of one or more programs having 
attribute information corresponding to the user's specifications. 
5 In accordance with further aspects of the invention, the user may cause recordation of a 

currently broadcast program being viewed by the user by causing generation of a pause input. 
This advantageously allows a user to interrupt viewing of a currently broadcast program by 
recording the remainder of the program for subsequent viewing. Program viewing options may 
be presented to the user in the form of a menu that provides an easy to use interface for selection 
1 0 of programs and viewing and other options including play, pause, delete, fast-forward, rewind 
and so forth. 

Preferably, the preference agent organizes the recordation preference information in the 
form of a database organized in accordance with categorization parameters. Programs may be 
received On either analog or digital formats. Programs stored in digital format are 
1 5 advantageously presented to the user in the form of additional channels. This allows the user to 
easily switch between programs (either recorded or broadcast) simply by switching channels. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a high-level block diagram of a Program selection device employing the 
20 principles of the present invention. 

Figure 2 is a high-level block diagram of a system employing the principles of the 
present invention. 

Figure 3 shows two examples for Program information. 
Figure 4 shows examples for traits and Liking values. 
25 Figures 5a-b are flowcharts illustrating the Data analysis performed on a representative 

sample. 

Figure 6 illustrate the process of computing the error in prediction of user choices 
Figure 7 illustrate a step in the process of regression analysis 
Figure 8 shows the relationship between two correlated traits. 
30 Figures 9a-c illustrate the process of determining trait-ness of a trait in a program. 

Figure 10 is a block diagram-showing an example for the Liking distribution record 

format. 

Figure 1 1 lists some sample values for different fields described in Figure 10. 

Figure 12 is a block diagram showing an example for the trait-ness record format. 
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Figure 13 shows an example for broadcasting trait-ness as a part of EPG data 
Figure 14 shows an example for user selection record format. 

Figure 15 is a block diagram showing the inputs and output for the generation of user 
selection history- 

5 Figure 16 is a flowchart illustrating the process of learning the liking values, performed 

on a program selection device. 

Figure 1 7(a) is a block diagram showing the inputs and output for the computation of 
relevancy value for selection history record. 

Figure 17(b) is a graph representative of relevancy over age. 
1 0 Figure 1 7(c) is a graph representative of repetition rate over relevancy. 

Figure 1 8(a) is a block diagram showing the inputs and output for the process of 
updating of past selection history 

Figure 1 8(b) is a flowchart illustrating the process of updating of past selection history 
Figure 19 is a flowchart illustrating the process of computing liking values performed on 
1 5 a program selection device. 

Figure 20 is a block diagram showing the inputs and output for the computation Liking 
for programs. 

Figure 21a illustrates distribution of income for different programs. 

Figure 21b illustrates distribution of gender for different programs. 
20 Figure 22 is a system architecture for providing targeted advertising. 

Figure 23a is a graph illustrating the relationship between Belief function and probability 
for a user belonging to particular demographic trait value. 

Figure 23b is a flow chart of a demographic trait record format. 

Figure 23 c is a flow chart of an advertisement targeting record format. 
25 Figures 24 and 25 are block diagrams illustrating operation of certain functions 

performed by the television recording system of Figure 1 . 

Figures 26(a)-b illustrate alternative hardware configurations in systems embodying the 
principles of the present invention. 

Figure 27 is a flowchart illustrating additional aspects of operation of the television 
30 recording system of Figure 1. 

Figure 28 is a block diagram of one embodiment of a system for providing EPG data and 
text data to a viewer. 
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Figure 29 is a block diagram that illustrates one embodiment of a data controller for 
receiving EPG data and text from data providers, formatting the data for display and inserting 
the data into a vertical blanking interval of a cable television channel. 

Figure 3t> illustrates one embodiment of an information field of the ERG data read from 
5 the EPG database of Figure 29. 

Figure 3 1 illustrates a data format of data read from the database for insertion into an 
assigned cable television channel. 

Figure 32 is a flow chart illustrating the operation of the EPG transaction formatter of 
Figure 29. 

1 0 Figure 33 is a flow chart illustrating one embodiment of the operation of the text 

transaction formatters of Figure 29. 

Figure 34 illustrates one embodiment of a set top box for use in receiving text data and 
EPG data. 

Figures 35-41 illustrate various aspects of the process of creating and using multiple 
1 5 viewer profiles according to the present invention. 

Figure 42 illustrates a method for distributing targeted electronic content without 
compromising the privacy of users. 

Figure 43 illustrates an embodiment of a system according to the present invention. 
Figure 44 illustrates custom linear programming according to the present invention. 

20 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

In Figure 1, a television control system 100 operates in accordance with the principles of 
the present invention to cause recordation of television programs in response to user inputs 102 
and television signals 104. Television control system 100 transmits signals to a television 

25 monitor 108 for viewing by the user. Preferably, in digital embodiments, programs that are 

recorded by system 100 are presented to the user in the form of additional channels. Thus, the 
user can rapidly determine, by changing channels, the stored programs that are available for 
viewing. The user can also change channels between stored programs or between stored 
programs and currently broadcast programs. If the user changes channels from a recorded 

30 program to another program, playback of the recorded program is preferably paused. 

Alternatively, whether the playback of the recorded program is paused or continued, is a user 

selectable option. As described further herein, the user may specify programs for recordation by 

specification of a particular program, or by specification of particular attributes of the program 

such as comedy/drama, actor(s). When manually specifying programs for recordation, the user 
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may specify that the program is to be recorded once or repeatedly, such as weekly, when 
broadcast. 

Signals 104 include a first component 105 which contains the information necessary to 
display video and audio components of a television program on television monitor 108. Signals 
5 104 preferably also include a second component 107 termed herein "attribute information." An 
example of such attribute information 107 is the information available by way of the DVB-SI 
and ATSC-SI formats and various proprietary formats such as StarSight EPG Data and TVData 
available from StarSight Telecast, Inc., Fremont, CA, and TVData, Glen Falls, NY, respectively. 

Attribute information 107 for any particular program varies depending on the program 
1 0 type, but typically includes a plurality of categories such as start time for the program, duration 
of the program, the title of the program and other, attributes (categories) of the program, together 
with an associated value corresponding to each of the categories. Preference agent 110 processes 
the attribute information 107 to generate if "category- value" pairs 115. For example, if an 
attribute for a program is duration, then the category may be duration and the value for that 
1 5 category may be 120 minutes. If the attribute for a program is title, then the category may be * 
title and the value may be "Star Wars." Other category-value pairs for a movie may include a 
description category with a short description of the movie being the value, a primary actor 
category with the names of the primary stars of the movie being the values, a director category 
with the name of the director being the value, a theme category with the theme such as 
20 adventure, comedy being the value, and a ratings category with ratings by particular critics being 
the value. Category- value pairs for a sports game, such as a football game, may include names 
of the teams who are playing, the location of the game, and the specific tournament, such as the 
play-offs, or Superbowl, etc. 

The category-value pairs 115 (preference information) are indicative of viewing 
25 preferences of the user. The data shown in Figure 1 as being associated with the category - 

value pairs 115 contains weighting information for the associated category value, in addition to 
other information shown by way of example further below. Preference agent 110 maintains the 
preference information 1 15 in the form of a preference database 1 16. Television programs 105 
recorded by the system 100 are preferably stored separately together with the associated attribute 
30 information 1 07. In an alternative embodiment, the category value pairs 1 1 5 (with or without 
the associated values) are stored with the television programs 105 and the raw attribute 
information 107 is not maintained by the system 100. 

Preference agent 110 generates, in response to user viewing habits, data for each 

category stored in preference database 1 16 and for each value of each category. The data 
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generated by preference agent 1 10 for each category and value is preferably indicative of the 
amount of time that the particular category and/or value is watched by the user relative to the 
total amount of time that the particular category and/or value is available to be watched. The 
relative amount of time that a program is watched by a user is a convenient indication of the 
5 user's relative viewing preference. However, other indications of user viewing preferences may 
also be used. Program source switch 1 14 operates in response to user inputs 102 to select either 
presently broadcast programs, by way of television signal 104 or stored programs from storage 
devices 106. 

Recording manager 1 12 operates to cause recordation and storage of television programs 
10 105 and attribute information 107 in accordance with information generated by preference agent 
1 10 and stored in preference database 1 16. Recording manager 1 12 also responds to user 
requests to record particular programs and to user requests to record programs having specified 
category- value pairs. 

The signals transmitted to the monitor 108 preferably take a conventional analog form. 

15 Alternatively the signals transmitted to the monitor 108 maybe digitally encoded. The exact 

form of the signals transmitted to the monitor is not critical and may take a form as required by a 
particular monitor. The television signals 104 received by the television control system 100 may 
take one of a variety of signal formats including analog encoded signals that are encoded in 
accordance with the well known NTSC or PAL standards. Alternatively, the signals 104 may be 

20 digitally encoded in a manner as transmitted by commercially available digital satellite systems 
(DSS) or in accordance with the MPEG-2 (Motion Picture Expert Group - 2) standard. In any 
given embodiment of television control system 100 the signal 104 may take a variety of the 
aforementioned forms. For example, television control system 100 may be coupled to receive 
inputs from a digital satellite system, the inputs being digitally encoded. The television control 

25 system 1 00 may also be coupled to receive inputs from a Community Antenna Television 

System (CATV) in which the signals are either encoded in analog or digital form. The television 
control system 100 may also be coupled to receive analog or digital signals from a conventional 
home antenna. 

The attribute information 107 may be transmitted to the television control system 100 

30 contemporaneously with the television program 105 in a variety of ways including industry 

standards, such as DVB-SI (Digital Video Broadcasting-Service Information) as defined by the 

European Telecommunication Standards Institute (ETS), or the ATSC digital television standard 

as defined by the Advanced Television System Committee (ATSC). By way of example, in the 

DVB-SI protocol, programming for the next six hours is transmitted every eight seconds for 
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each channel.. As a further example, program information for the next seven days is available 
from the interactive on-screen TV program guide available from StarSight Telecast, Inc. 
Programming information further into the future, such as for the next seven days, may also be 
obtained in other ways. For example, by receiving the information in a time-multiplexed manner 
5 over a particular channel. Such information can easily be transmitted when the user is 

performing an action that does not require a moving video image on the screen, such as when 
the user has a control menu displayed on the screen. 

Alternatively, television control system 100 can download the attribute information 107 
separately from the television program 105 by way of a separate communication session via a 
1 0 modem or the Vertical Blanking Intervals (VBI) contained in television signals. Such separate 
communication sessions include data download mechanisms supported by the MPEG-2, DVB- 
SI and DSS protocols. 

The attribute information 107 can take a form under the DVB-SI protocol as shown 

below: 
1 5 event_id, 

start_time. 

duration, 

DESCRIPTORl, 

DESCRIPTOR2, 
20 ... 

DESCRIPTORS 

The event-id field Is a unique alpha-numeric code assigned to a program. DESCRIPTORS can 
be "Short Event Descriptors," "Extended Event Descriptors" or "Content Descriptors" which 
include the following information: 
25 Short Event Descriptor: 

{ 

event_name-Iength 
event_name, 
event_description-length 
30 event_description 
} 

Extended-even 

{ 

ITEMl 

- 10- 



BNSDOCIO <WO 0117250A1_IA> 



WO 01/017250 



PCT/US00/23868 



ITEM2 
ITEMn. 

5 } ' 

content descriptor: 

{ 

CONTENT 1, 
CONTENT2, 

10 

CONTENTn. 

} 

ITEMs include the following information: 
15 { 

item_description_length, 
item_description, 
itemvaluelength, 
item_value 

20 } 

An example of item descriptions can be "Director" and item value can be "Martin 
Scorsese". CONTENT includes the following information: 

{ 

DVB-SI defined theme, 

25 DVB-SI defined sub-theme, programmer defined theme, programmer defined 

subtheme, 

} 

An example of theme and subtheme are MOVIE and COMEDY, respectively. The 
programmer defined theme and sub-theme are values that may be provided by the EPG Data 
30 provider. 

Category- value pairs 1 15 are generated from the above type of information. The 

category-value pairs 1 15 take the following format: Category Name - Category Value, where 

category name cart be "Title", "Director", "Theme", "Program Type" etc. and category values 

can be "Seinfield" "Martin Scorcese", "Comedy", "Sitcom" etc. Generation of category- value 
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pairs 1 15 from attribute information 107 allows generation by preference agent 110 of categories 
that are not explicitly present in the attribute information 107. For example, category-value pads 
1 15 can be: Title-49ers, Description-football, and Description Search Rule-football(AND) San 
Francisco. Thus; preference agent 1 10 is capable of generating category- value j>airs 1 15 from 
5 attribute information 107 even where there is no field in the attribute information that 
corresponds to the created category-value pair. 

Preference database 1 16 is preferably generated initially by downloading category-value 
pairs from a third-party source such as StarSight Telecast, Inc. Advantageously, such sources 
may provide information customized for particular geographical areas and dates. For example, 

10 the database may contain data that gives sporting events involving local teams higher ratings 

than other sporting events. In addition, seasonal or holiday programs may be indicated as being 
preferred during particular seasons or holidays. For example, programs involving summertime 
activities would be indicated as having higher weighting during the summer than at other times 
of the year. The preference database is modified as described herein in accordance with the 

15 user's viewing habits. In addition, the preference database can be periodically updated from 
third-party sources to reflect the aforementioned seasonal or holiday updates. 

Categories in the preference database 1 16 are either predefined, such as those received 
from third-party sources, or are dynamically created from attribute information 107 received for 
programs 105. Categories, and associated values, that are dynamically created are preferably 

20 given a default rating by preference database 116. An example of the preference information 
created by preference agent 1 10 or downloaded to preference agent 1 10 is shown below. In the 
following example, the three columns of numbers in the category statistics and value statistics 
portions indicate weighting (in a range of 0 to 1000) duration watched (in seconds) and amount 
of time that programs matching that particular category or value was available (in seconds). The 

25 information is preferably stored in the form of database records. 



Categories: 

channel 1000 

title 1001 

title-Substring 1002 

30 genrelnfo 1003 

description 1 004 

descSubString 1005 

episodeName 1007 

type 1008 

35 stars 1009 

director 1010 

YearProduced 1011 
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MPAARATING 1012 
criticRating 1013 
Values: 

Titanic 2000 

5 Ami 2001 
3rd Rock From the Suri 2002 
The Gods Must Be Crazy 2003 

Seinfeld 2004 

Headline News 2005 

10 Bugs & Daffy 2006 

News 2007 

004 2008 

005 2009 
063 2010 

15 49ers 2011 

SITCOM 2012 

COMEDY 2013 

MOVIE 2014 

NEWS 2015 

20 Sanfrancisco 49ers 2016 
A Coke bottle raises 
havoc for a tribe of 

African bushmen 20 1 7 

John Mayers 2018 

25 Lousie Barnett 2019 

Man us Weyers 2020 

Sandra Prinsloo 202 1 

Jeff Bridges 2022 

Valerie Perrine 2023 

30 Phil Hartman 2024 

Jamie Uys 2025 

Lamont Johnson 2026 

1981 2027 

1973 2028 

35 1996 2029 

THREESTAR 2030 

TWOSTAR ; 2031 

NUDITY 2032 

VIOLENCE 2033 

40 ADULTSITUATIONS 2034 

ATULTLANGUAGE 2035 



Category - Value pairs: 





1001 


2001 




1001 


2002 


45 


1001 


2003 




1001 


2004 




1001 


2005 




1001 


2008 




1000 


2009 


50 


1000 


2010 
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1002 2011 






1003 2012 






1003 2013 






1003 2014 






Category statistics: 






1001 




1000 31104 4022280 


1002 




1000 31104 4022280 


1003 




1000 31104 2613384 


1004 




1000 20304 1996596 


1005 




1000 20304 1996596 


1006 




1000 5238 1259028 


1007 




1000 3438 369450 


1008 




1000 13266 812970 


Value Statistics 






2001 . 


1000 


1638 88074 


2002 


1000 


6714 178560 


2003 


1000 


6552 387054 


2004 


1000 


5400 165600 


2005 


1000 


1600 9000 


2006 


1000 


3600 28800 


2011 


500 


1800 10800 



In the above example, fourteen categories are provided (1000 - 1013) followed by thirty- 
six values. The correspondence between the categories and values (category - value pairs) is 
25 next shown. Data for the categories and then the values is shown next. This data is organized in 
three columns as described above. 

In the embodiment shown in Figure 1 and described above, the ratings for categories and 

values are dynamically generated by the preference agent 110 instead of being stored in 

preference database 116. In an alternative embodiment, the ratings may be stored in preference 

30 database together with the category- value pairs. 

« 

Referring now to Figure 2, preference determination is used to predict a user's 
preferences in the choice of TV programs. This prediction is based on, (i) analysis of the 
individual users viewing habits, (ii) analysis of the viewing habits of a representative sample of 
users and (iii) EPG data for the television programs available during the period of the collected 
35 sample. 

The viewing habits of a user are described below in the context of actual programs 
watched by the user. However, other information may be just as useful for the purposes of the 
invention, and includes information regarding programs that are never watched by the user or 
watched only very briefly before changing to another channel, programs that the user schedules 
40 to record, programs that the user records but does not watch or watches only briefly, and 

programs for which the user requested information from a program guide (e.g. an on-screen EPG 
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menu) or for which the user never requests information or for which the user requests 
information but does not choose to watch. 

One method for creating a preference determination is through the use of a Bayesian 
network that is used to predict user behavior, and hence a preference agent, is a knowledge 
5 based approach. In this method a knowledge engineer interviews an expert about the Field of 
expertise of the expert. The expert and the knowledge engineer determine the distinction of the 
worlds that are important for decision making in the field of the expert. The knowledge engineer 
and the expert next determine the dependencies among the variables and the probability 
distributions that quantify the strengths of the dependencies. 

10 A second method is Bayesian network is a data based approach. In this second method 

the knowledge engineer and the expert first determine the variables of the domain. Data is then 
accumulated for those variables, and an algorithm applied that creates the belief network to 
predict user behaviors from this data. The accumulated data comes from real world instances of 
the domain. This second approach exists for domains containing only discrete variables. 

1 5 Bayesian methods can be performed by utilizing an algorithm known as the "EM" 

algorithm, which, as recognized by those skilled in the art, calculates the maximum a posteriori 
values ("MAP values") for the parameters of the model to predict user behavior. The EM 
algorithm is described in Dempster. Maximum Likelihood From Incomplete Data Via The EM 
Algorithm, Journal of the Royal Statistical Society B, Volume 39 (1977), incorporated herein by 

20 reference. 

After calculating the probability for each variable, each variable within the belief 
network is then scored for goodness in predicting the preferences of the user by rendering a 
subscore. As recognized by those skilled in the art, the subscores are generated by using the 
marginal likelihood of the expected data at the maximum a posteriori values. 

25 , One skilled in the art will recognize that any standard model to predict user behavior 

inference algorithm can be used to perform this step, such as the one described in Jensen, 
Lauritzen and Olesen, Bayesian Updating in Recursive Graphical Models by Local 
Computations, Technical Report R-89-15, Institute of Electronic Systems, Aalborg University, 
Denmark, incorporated herein by reference. 

30 Television viewing habits of the population are sampled to generate representative 

samples of viewer behaviors, 1 17. This task is typically performed by companies such as 

Nielsen Media Research, who collects user behavioral samples to conduct market research. The 

results of the sampling is stored in a viewer behavior database. The format of the viewer 

behavior database is proprietary to the company conducting the sampling. Viewer behavior 
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5 



database contains demographic information about the people in households participating in the 
sampling. This information includes but is not limited to race, age, annual income and gender. 
Viewer behavior database also contains information about all the television programs each 
viewer watched* during the period of sampling. 

Program Information and the schedule of television programs is available in the EPG 
database, 104. Program Information for a television program contains information about various 
attributes of the program which includes but not limited to the title, program type and program 
category of the television program, arid also the actors acting in the television program. 

Figure 3 lists two examples of program information, example 1, 124, describes the 
1 0 program information for a audio visual television program, and example 2, 125 describes 
program information for a graphical text based television program. 

Viewer behavioral database is analyzed 1 18 to identify traits of TV programs and 
determine how such identified traits influence viewing habits in the representative sample of 
users. These traits and their influence amongst the viewing population are then used in aiding 
1 5 the process of predicting an individual user's choice in TV programs. Figure 4, 126, lists some 
examples of such traits. Some of these traits are derived from program information, 104. Some 
of these traits are derived by looking for users liking for association of multiple traits or 
association of traits with other viewing parameters which includes but is not limited to time, day 
of the week, holiday, working day and weather season. Some of these traits are derived by data 
20 analysis of Viewer behavioral database. Any given program would exhibit varying degrees of 

the above identified traits. This is expressed as the trait-ness of the trait in the program, e.g. trait- 
ness of comedy in Seinfeld may be 1 .2 and trait-ness of comedy in Mad About You may be .79. 
The trait-ness of a trait in a program is computed as a part of the data analysis, 118. 

The present invention also provides for a methodology for determining viewer 
preferences. In one embodiment, an individual viewer picks one show to watch out of the 
collection of available program by evaluating a stochastic liking function for each program and 
choosing the program with the highest score. The liking function is modeled as an aggregate of 
liking for a specific trait and the degree to which that trait is exhibited by that program. The 
liking function can be computed as: 
30 l(p) = ZX(tn)* tn 

Where 

I(p) represents the liking for program p 

X(tn) represents the liking for a specific trait tn 

tn represents the degree to which the Trait tn is exhibited by p 
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When a viewer watches television, it is assumed that the viewer chooses a program for 
which the value of l(p) is the highest. For any given viewer, the better the values of \(tn) 
represent its actual liking, the more the accuracy in determining its preference for TV programs. 
Thus for any given user if all the relevant traits are known along with the exact liking of each of 
5 those traits, the above Liking Function would always accurately predict the liking of any given 
program. 

However, identification of all the relevant traits and the determination of the exact liking 
of each viewer is a non trivial task and arrived at iteratively by a process of regression analysis. 
While a majority of the traits relevant to a specific user may be derived directly from the 

10 EPG information itself, some of the traits are discovered by analyzing the user's viewing habits 
over time. Traits that are suitable for discovery by a process of regression analysis are generally 
hidden or associated traits. 

Hidden traits are those which influence a user's viewing habits but which can not be 
derived from the EPG information. For example, sitcoms featuring a specific ethnic background 

1 5 would resonate more with viewers of that ethnic background. Another example of the effect of a 
hidden trait could be the strong liking of the sitcom "Frasier" amongst the set of users who have 
a strong liking for the sitcom "Cheers." Assuming that the names of actors performing in a 
sitcom are not be available in the EPG data, this affinity towards both the sitcoms by the same 
set of people may be explained by the presence of some trait commonly exhibited by both the 

20 sitcoms, namely the presence of one of the central characters in both the sitcoms. Such hidden 
preference criteria would have to be captured by adding the hidden traits in the computation of 
the Liking Function. 

Associated traits - Traits which have a different influence on a user's viewing habits 
when combined with other traits. For example a user would have a certain liking for any given 
25 / Seinfeld episode, and a certain liking for any premiere sitcom being aired for the first time. 
However, its liking for a premiere episode of Seinfeld may be sufficiently large enough to 
require an additional trait, "new Seinfeld", to fully explain its liking for a premiere episode of 
Seinfeld. 

The liking of each trait for a given user has to be similarly refined from initial 

30 approximations by regression analysis. Examples of traits and the associated liking for a sample 

user include but are not limited to those listed in Figure 4. The trait "NBC <=> NEWS" is an 

example of an associated trait where the traits being associated are "Channel NBC" & "Program 

Type News." Users liking for NEWS programs may be 1 & the preference for the "NBC 

Channel" may be 2 whereas its preference for NEWS on NBC may be 13, i.e., this user does not 
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always watch news programs or programs in general on the NBC channel, however he has a 
strong preference for NEWS programs on the NBC channel. If only its liking for the "Channel 
NBC" & "Program Type News" are considered, its preference for "News on NBC" would be 
unexptainable. 

5 This procedure to identify traits is first carried out using the viewing habits of the 

representative sample along with the determination of the distribution of likings of each trait 
within the representative sample. 

Some of the outputs of the analysis 102, of the viewing habits of a representative sample 
are a set of preference determination parameters viz. (i) the traits which are exhibited by 

10 recurring program and the degree to which such a trait is exhibited, and (ii) a distribution list of 
the likin gs amongst the viewing population of each of the above trait. 

One methodology to derive these parameters is outlined in Figures 5(a) and 5(b). 
Different phases of the flow chart are explained further in the following sections. An initial s$t.^ 
of liking values are computed for each user, 129. This initial liking value may be arrived at in 

15 various ways. One of the ways could be to base it on the amount of time for which a given tratt 
was watched as opposed to the amount of time it was available during the user's viewing period. 

Using these initial liking values for the traits the Liking Function l(p) is computed for all 
the programs that were available during the times the user watched a given program. These 
programs are then arranged in descending order, 140, of liking value for a time period during 

20 which the viewer watched a program, 14 1 . The actual program the user watched at a given time 
is highlighted, 143. If all the highlighted programs in Figure 6 had the highest score, the Liking 
Function correctly identified the user's preference. If the program watched by the user ranks 
below some other competing program available at that time the liking values used do not 
correctly reflect the user preference. By way of example, but not by way of limitation, if there 

25 are N programs ranked above the program actually watched, the error in the Liking Function for 
this program may be called N, 142. Such values of N are determined for each program watched 
by the user by comparing it with the Liking Function of the other competing programs. In the 
example illustrated in Figure 6 values of N are computed for the times when the user watched 
TV between the hours of 10:00-10:30, 10:30-11:00, 1 1:00-1 1:30. Using regression analysis, the 

30 liking values of each of the traits are adjusted incrementally to reduce the average value of N. 

The set of liking values which yield the lowest value of the average of N are considered the best 
set of liking values for that user. 

One methodology of the regression process is illustrated in Figure 7. At the beginning of 

the process liking values for traits a, b, c, are Xal , Xbl, Xcl B . ..(144,145). Xal represents the 
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liking of trait, "a," Xbl represents the liking of trait "b M and so on. The Liking Function of all the 
relevant programs are computed using this initial set of liking values. The liking value \al of the 
first trait V is changed incrementally to determine the average value of N. The value X.a2, 147, 
which yields the lowest average value of N, is taken as the new liking value for the trait "a." 
5 This new value liking value Xa2, 148, now replaces the old value and is now used to determine 
the remaining X values. The above set of liking values computed would predict the users 
preferences with a high degree of accuracy if set of the traits considered in the Liking Function 
includes all the traits that are relevant to a user. The average value of N not converging (to 0 or 
some other acceptable value) would indicate that not all traits that are relevant to the user have 

1 0 been considered in the computation of the Liking Function. Introduction of additional traits, 
either associated or hidden, is used to improve the determination of the user's preference. 
A determination of associated traits is achieved by a variety of different methods. 
One method is through the application of heuristic rules of thumb where the associative 
value of a number of traits is not reflected in the program information obtained from an EPG but 

15 is relevant to human viewing habits. For example, a user who has a liking for Seinfeld would 
most probably have a much higher preference for a premiere or new episode of Seinfeld. Such 
heuristic rules of thumb regarding the associative value of a number of traits may be passed to 
each set top box via the Head End. Another method of determining associated traits is with an 
algorithmic search which looks for common traits in programs and introduces new associated 

20 traits to try to improve the Liking Function for a user. 

Determination of hidden traits in different program s can be achieved in a number of 
ways. One of these methods is illustrated in Figure 8. The possibility of the presence of a hidden 
traits exists whenever there exists a strong co-relation between any two traits present in different 
programs. The y-axis represents an increasing liking value for liigher values of y. Every point on 

25 the x-axis in Figure 8 represents a user arranged such that the liking value of the trait "to" 
increases with higher values of x. If for the same values on the x-axis the liking values of a 
strongly co-related trait n tm n are also plotted, it will exhibit a relation to the liking graph of the 
trait "tn". The co-relation between any two traits may be explained by he presence of a common 
hidden trait. Thus traits "tm" and "tn M may be expressed mathematically as 

30 tm = tx + tm f 

tn = Ctx + tn f 

where C is some constant indicating the amount of co-relation between traits "tm" & "tn". 

Hidden traits can also identified by applying rules of thumb or some other appropriate 
manner. 
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While ail programs of a specific genre exhibit some common traits, the degree to which 
these traits are exhibited vary from program to program. This degree of traitness for recurring 
programs can be quantified such that it best explains the viewing choice of watching or not 
watching that program in the representative sample. For example consider a user who has a 
5 certain liking □ for comedy. The users' decision to watch a comedy program would be 
influenced by whether the amount of comediness exhibited by that program crosses some 
threshold of comediness that is determined by its liking for comedy. | 

One method to determine the traitness of a trait T in a given program N is illustrated in 
Figures 9(a)-(c). The highlighted program represents the programs actually watched 156 by a 

1 0 user. All available programs are arranged in decreasing order of rank 154 as computed by the 
Liking Function. In the case of User 1 155 the Program N 156 which was watched by him 
appears third in rank whereas the Liking Function should have actually ranked Program N at the 
top. In this case the margin of error 157 in the traitness of trait T is 3. In the case of User 4 the 
program N watched by the user was ranked the highest. The margin of error in the traitness oF 

1 5 trait T is 0. In the case of user 2 where he did not watch Program N, it was ranked the highest " 
The margin of error in this case can be considered to be a constant K. A suggested value of K 
may be the number of programs available to the user at that time. Similarly for user 3, the 
margin of error may be considered K-l. Such margins of error are computed for all users who 
watched the Program N and the average margin of error is computed. 

20 Using regression analysis, the traitness value of the trait T is adjusted incrementally to 

reduce the average value of the margin of error. The value of traitness which yields the lowest 
value of the average of the margin of errors is considered the best value of the traitness of the 
trait T exhibited by the program N. 

Traitness values may also be assigned by rules of thumb or some other appropriate 

25 manner. 

One of the outputs of analyzing the viewing habits of a representative sample is a 
distribution list of the likings amongst the viewing population of each trait. This information is 
presented schematically in the Figure 10 and is made available to every individual user Set Top 
Box along with the broadcast TV programs and program information. "Number of traits", 164, . 
30 represents the total number of traits that have been identified after analyzing the viewing habits 
of the representative sample. The distribution of liking of each trait is provided in a M Trait 
Record", 165. Information included in a trait record include the name of the trait, 167, the type 
of the trait, 168,( whether hidden, associate, etc), the liking distribution of the trait 170, and the 
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distribution.parameters, 172, of the trait. Examples of various possible values in a trait record 

are provided in Figure 11. 

Figure 12 illustrates one schematic representation of traits present in recurring programs. 

This information is passed along with the above liking distribution of traits. The "Number of 
5 programs" 1 73 represents the number of programs for which trait information is sent. Relevant 

traits and the degree to which they are exhibited are included in the "Program Record" 174. The 

"Program Identification" 177 information is used to identify the TV program to which this 

record pertains. An example of a Program record for a Seinfeld Episode is given in Figures 13. 

As given in the example certain traits such as "Program Name - Seinfeld", "Channel -r NBC", 
1 0 "Program Type - Sitcom" can be derived directly from the EPG and are called static traits. 

Additional traits such as "comedy" which store the degree of comediness associated with a 

Seinfeld Program may also be passed as a part of the EPG data. 

Referring again to Figure 2, the program content and the program information (in the T 

form of an EPG) are transmitted from a Broadcast Head End along with preference 
1 5 determination parameters, 1 19. Examples of preference determination parameters include but.?ure 

not limited to (i) the traits which are exhibited by each program and the degree to which such a 

trait is exhibited and (ii) a distribution list of the liking values amongst the viewing population 

of each trait. 

This information is received in each household by a program selection device 100, that 
20 include a preference Determination Module, a personal preference database, 1 16, a storage 
device and a display device 108. The personal preference database 1 16, is used to store the 
results of the analysis of the individual user ! s viewing habits. The storage device stores programs 
as per the user's specific requests or programs recommended by the preference determination 
• module. 

25 Program Selection Device 100 monitors each user's viewing actions and selection of TV 

programs being watched. This is stored in a storage device or in memory in the form of selection 
history data 189 (see Figure 15). The schematic representation of selection history data is given 
in Figure 14. The "Number of selection records", 180 represents the number of user selection 
choices stored in the selection history data. Each selection record 181, contains the information 

30 on the actual programs watched 185 along with information on the competing programs 

available at that time 1 86. Storing program information is required as the EPG may not be able 

to provide information on past programs. Information on these program may be obtained 

directly from the EPG data when the information is obtained while the program is still current. 

The rime 1 83 and duration 182 for which a program was watched also form a part of the 
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selection recoxxL As illustrated in Figure 15, a uses selection history 189 is derived from each 
choice 187 made by the user along with program information from the EPG 104. 

One; process of learning the user preferences of a specific individual is illustrated in 
Figure 16. The liking distribution in the representative sample for traits identified by data 
5 analysis, 1 1 8, are used by the Liking Function to minimize the effect of error introduced because 
of lack of sufficient sampling in the computation of the liking of the identified traits by an 
individual. An individual user's response to setup question 190 may also be factored in 
determining initial values of liking for that individual 192. 

A user selection history 189 is maintained for a fixed number of hours. The number of 

1 0 hours for which the user selection history is maintained can be preferably changed to have an 
increased rate of learning during initial days of a new user using the device. Average Error is 
computed N 194, in a similar manner as for users; in the representative sample, as previously 
described in Figure 6. If the average error is greater than a tolerable limit 195, new liking values 
are computed 198. Entries in the user selection history are moved to the Past selection historyTf 

15 the Average error remains under a tolerable limit the liking values are computed only after a " 
predefined number of hours 196. 

Another embodiment of the selection record stores a program ID instead of the entire 
program information. In this embodiment, the actual program information is broadcast at 
predetermined times from the Head End where each program is identified by the same program 

20 ID stored in the selection record. In this embodiment, the computing 198 of liking values for 
traits are performed only at this predetermined time. 

Referring now to Figure 17(a), after each computation of liking values, entries in the user 
selection history is moved to the Past selection history. Selection history is partitioned as user 
selection history 189 and Past selection history 216 to optimally store the section history 

25 records. Selection history contains information about all selections made by the user between 
two computations of Liking values. Past selection history maintains the most relevant 
information about the most relevant selections made by the user in past. The number of records 
in the Past selection history can be advantageously configured to suite the memory available for 
storage of such records. Relevancy in this context is the importance of the record in computing 

30 the liking values and is dependant on many parameter including but not limited to the age of the 

record 200, the repetition rate 203 of traits contained in programs 204 in the selection record. 

The relationship between relevancy and age 205 of the record is shown in Figure 17(b). The 

most relevant records are the most recent records. The relationship between relevancy and the 

repetition rate of traits contained in programs in the section record is shown in Figure 17(c). As 

-22- 



BNSDOCID: <WO 011725QA1_IA> 



WO 01/017250 PCT/US00/23868 

the repetition rate decreases, the relevancy increase 206, till certain limit after which the 
relevancy decreases 207. A good example to illustrate the reasoning behind this relationship 
would be relevancy of keeping records on user selection when Seinfeld a weekly program is 
available and the relevancy of keeping records on user selection when Olympics a 4 yearly 
5 program is available. While it is important to keep records about Seinfeld to compute likings of 
various traits contained in Seinfeld, it is pointless to keep records about viewer's liking for the 
last Olympics. Consider a user who does not have a liking for movies except on Fridays. To 
accurately determine the user likings of trait of "Friday night-ness" of a movie, the selection 
records pertaining to its viewing actions on a Friday have to be maintained for at least a number 
1 0 of weeks. However if the repetitive nature is too far apart in time, the relevancy decreases 
sharply- The "Relevancy versus Repetitive Rate of a Trait" graph in Figure 17(b) measures 
increasing values of relevancy for higher values on the y-axis. The x-axis measure decreasing 
rates of repetitiveness of a trait in a program. As shown in Figure 17(a), beyond a certain . 
threshold of repetitive rate, the relevance decreases sharply. 
\ 5 The process of updating of past history is shown in Figures 1 8(a) and 1 8(b). The 

processing is done for every record in the user selection history. If there is enough memory to 
store the new record in the past selection history 2 1 8, the record is removed from the user 
selection history and added to the past selection history 222. If there is not enough space in the 
past selection history 218, the records in the past selection history are sorted based on relevancy 
20 220, and :he least relevant record is deleted 221 to make space to add the new entry 222. 

In another embodiment of the present invention, the number of available programs 1 86 
stored in the past selection history can be limited to an optimal number to make best use of the 
memory available (see Figure 18). In this embodiment only the programs with higher values of 
liking, representing the traits which have significant liking values, are stored in past selection 
25 history record. 

Computing of Liking values for different traits performed by the Program selection 
device 100 is shown in Figure 19. This process is performed 198 during the learning process as 
explained by Figure 16. Regression analysis is performed to minimize error in prediction of 
viewer behavior stored in user selection history and past selection history by using current liking 
30 values as the starting point. The regression analysis process 223 is the same as the one 

performed in representative sample 131. If there are any selection record for which the error is 
more than a predefined threshold 224, the selection record is examined for the presence of 
possible associated traits 225. The rules which defines the presence of possible associated traits 

may be rules of thumb. Possible association of traits can also be discovered by looking up the 

-23- 



BNSDOCID: <WO_ 



011725QA1_IA> 



WO 01/017250 



PCT/US00/23868 



list of associated traits which were discovered from the representative sample. Liking for 
possible associated traits are computed by regression analysis 226. If the computed liking is a 
significant value 231, the liking for trait is entered 230 into the preference;database 1 16. Other 
parameters like the repetition rate for the trait also is entered into the database 230. The 
5 repetition rate is computed by looking at the repetition rate of the trait in user history and past 
history. If there are no previous selection record with this trait, then the repetition rate is 
assumed as a predefined value. As new selection records are created, the repetition rate is 
updated. At the end of the process (illustrated in Figure 19) of computing the liking values, a 
weighted average of the current liking values and the old liking values is computed. This forms 

1 0 the new set of liking values. Learning rate can be increased or decreased by changing the weight 
of the current liking values. 

A flow chart of computing scores for programs for future prediction is illustrated in 
Figure 20. As described earlier Liking 23,6 .for a program 232 is computed by evaluating the : 
function l(p) = S X(tn) * tn. X(tn) is computed as a weighted average of liking for the trait tn for 

1 5 the user 234 and the liking for the trait tn for the general population 235. 

In another embodiment of the present invention which is suited to function on program 
selection devices operating on households with multiple users, the viewer is asked to input the 
number of viewers in a household during the initial setup process. Each viewer can answer a set 
of setup questions which would capture their liking for representative traits. An initial set of 

20 possible liking values are created for each viewer. Learning process updates each viewer's 
preference database separately after identifying each viewer session correctly. Viewer 
identification is done either by explicit input from the viewer identifying the viewer or 
automatically by identifying the viewer by monitoring the television viewing behavior of the 
viewer. To identify a viewer automatically, the error value between predicted and actual is 

25 computed for all viewers. The viewer with liking value which yields the least error value is 

chosen as the possible viewer. The certainty of this decision is expressed as a probability which 
is computed as a function of the differences between error values for different viewers in this 
household. 

The objective of the present invention is to determine demographic characteristics of a 
30 user by analyzing the users viewing habits in juxtaposition with viewing habits of a 
representative sample of users whose demographic characteristics are known. These 
demographic characteristics of a user collectively constitute a demographic profile. Upon 
successful creation of an accurate demographic profile for a user, the present invention can 
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receive a collection of possible ads and show individual users only those which are targeted to a 
matching profile. 

Some example of demographic characteristics include, but are not-limited to, a users 
gender, race, age and income. The output of the analysis of viewing habits of the representative 
5 sample provides a basis for determining demographic characteristics of the individual user. 

The TV program viewing preference information chosen in the analysis of viewing 
habits, of the representative sample, plays an important role in determination of a demographic 
characteristic. Different TV programs might be required to determine demographic 
characteristics. Typically TV programs where a majority of the viewing audience shares a 
10 common demographic characteristic, in a proportion which is different from what is observed in 
the general population, are best suited in the discovery of demographic characteristics. For 
example to determine the "income' 1 demographic characteristic consider graph (i) in Figure 21a. 
The x-axis represents the annual salary in increasing order of x. For any point in the x-axis, the 
y-axis represents the probability of finding an annual salary of x in the representative sample 
1 5 (which is the same in the general population). Now consider graph (ii), where the dotted curve " 
plots the same income distribution represented in the graph (i), the first solid curve (the leftmost 
curve) shows the income probability distribution of the viewing audience for program PI in the 
representative sample and the second solid curve represents the income probability distribution 
of the viewing audience for program P2 in the representative sample. Analysis of the viewing 
20 habits of an individual user for the above programs PI & P2 could be used to determine his most 
probable annual income. Using Bayesian inference this probability may be expressed by the 
following mathematical formula 

P(Ixy|Wpl)=P(Wpl|Ixy)P(Ixy)/P(Wpl) 
where 

25 - Ixy represents an annual income range of x-y 

-* Wp 1 represents watching program pi 

- P(Ixy|Wpl) represents probability of income of Ixy given the fact that program 
pi was watched 

- P(Wpl |Ixy) represents probability of watching program pi given the fact that • 
30 the annual income is Ixy 

- P(Ixy) represents probability of income of Ixy in the representative sample 

- P( w PO represents probability of watching program pi in the representative 
sample 
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Thus for a user who watches program pi, the probability of his income being within a 
certain range can be determined. 

Programs whose viewing audience display a demographic characteristic in the same 
proportion as is observed in the representative sample (which is the same in the general 
5 population) do not contribute significantly in determining demographic characteristics. Consider 
the graphs in Figure 21b. Graph (i) shows a bar graph which represents the probability of a 
person in the representative sample being male or female. The graph (ii) shows the probability of 
person being male or female within the viewing audience of program P3 which is very similar to 
what is observed in the general population. If Bayesian inference is used to determine the gender 

10 of a person who has watched program P3, the probability of the person being a male or a female 
would be the very close to the probability of person being a male or a female in the general 
population (which is already a known value). Nothing significant is gained by analyzing the 
viewing habits of the individual for the program P3. On the basis of the graph (iii) on the other 
hand, analyzing viewing habits for program P4 would yield additional certainty in determining 

1 5 the gender of a person than what is already known about the gender distribution in the general \ 
population. 

The determination of which programs yield high probability values in determining a 
demographic characteristic of a user may be done by applying rules of thumb through an 
algorithm which chooses the program on the basis of how much a demographic trait in it 
20 viewing audience differs from what is observed in the general population by some other means. 

Depending on the program content and the demographic characteristics of its viewing 
audience, the same television program maybe used to determine one or more traits. It is also 
possible that a completely different set of programs are required to study different demographic 
characteristics. 

25 Furthermore, the present invention contemplates analyzing other preferences of a user in 

addition to or in place of the users television viewing preferences. Such information may 
include, but is not limited to, musical preference, reading preferences, shopping preferences (a 
vers' large category that comprises, among many others, preferences in clothing, furniture, 
jewelry, decorations, appliances, electronic equipment), political and religious tendencies, etc. 

30 Such information may be obtained from any source, but in one preferred embodiment it is 

obtained from on-line sources (typically accessible as world wide web sites) that the user is a 
member of or visits frequently, such as music clubs, book clubs, other special-interest sites (e.g. 
a site devoted to a particular sport or hobby), news sites, etc. Other information descriptive of 
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the users preferences includes user subscriptions, such as to newspapers, magazines, newsletters, 
shopping catalogs, and other special interest publications. 

While it is possible from Bayesian inference to ascribe a certain preference for a specific 
show based on a users demographic characteristics, it may not l>e the only factor which explains 

5 the users choice of a particular show. For example, consider the graph (ii) in Figure 21a. It might 
be possible for a user, Ul, having income less than xl having a strong preference for program 
P2 for unknown reasons. So just using Bayesian inference to arrive at Ul's most probable annual 
income based on his viewing habits of only a single program P2 would not yield the most 
meaningful result To strengthen the belief of a Bayesian inference, the users viewing habhs 

0 have to be analyzed for a set of programs where the viewing audience of each program from that 
set displays a similar demographic characteristics. The programs of the set are so chosen that 
degree of co-relation of traits exhibited by the programs in the set are minimal. Each program 
that the user views from this set of programs adds towards strengthening the belief of the 
Bayesian probability of the user possessing that demographic characteristic. 

5 To explain the need for minimal co-relation consider the previous example. If another * 

program P3 which is very similar to P2 in content and has the same demographic traits in its 
viewing audience as P2 is available to user Ul, he would have a similar liking for P2 as P3. In 
this scenario strengthening the belief of the Bayesian probability by the user Ul watching 
program P3 is not a positive contribution. However if program P3 were very different in 

10 contents and the traits that it exhibits, the probability of U 1 choosing P3 is greatly diminished 
and keeps the belief in the Bayesian probability from erroneously rising. In this situation the 
probability of a user who actually possessed the demographic characteristic of watching both 
programs P2 &: P3 would be quite high and the strengthening of the belief would be a positive 
contribution. 

IS Thus a Belief Function for the Bayesian probability for a demographic characteristic del 

may be computed in many ways which includes but is not limited to 

BF(dcl) - MAX( bp(wl), bp(w2) .., bp(wm) )*crl + 

MAX(bp(wk), bp(wk+l) bp(wk+ra) )*cr2 + . . . + 

MAX (bp(wn), bp(wn+l) .., bp(wn+m) )*cm 
30 where 

- bp(wn) is the Bayesian probability of a user having that demographic characteristic 
given the fact that he watched program wn 
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- (wl y w2, ... wm), (wk, wk+1, wk+m), (wn, wn+l, wn+m) 

are sets of programs where any member of a set has a high degree of co-relation with another 
member of the same set, but a very low degree of co-relation with a member of another set. 

- MAX(bp(wn), bp(wn+l) .., bp(wn+m) ) represents the maximum 
5 Bayesian probability value of all possible values within that set 

- crl represents a co-relation co-efficient of the set (wl t w2, . . . wm), c2 represents co- 
relation co-efficient of the set (wk, wk+1, wk+m) and so on. Elements of the same set have 
the co-relation co-efficient. 

The Belief Function for any demographic characteristic is illustrated by the S curve in 
1 0 Figure 23a. The x-axis represents increasing values of the Belief Function for a demographic 
characteristic for increasing values of x. The y-axis represents increasing values of the 
probability of a user displaying that demographic characteristic for increasing values of y. As 
shown in Figure, the probability does noUncreasing any further for higher value of the Belief 
Function. 

1 5 Some of the output of the analysis of viewing habits of the representative sample are . , 

schematically illustrated in Figure 23b: 

Display of advertisements by a program selection device in accordance with the 
invention is done when a users computed demographic profile matches the target profile of the 
advertisements. Some of the elements contained in an advertisement are the advertisement 
20 contents (may consists of video clips, audio clips, and/or graphical or textual information) and 
the meta data which includes information on the demographic profile to which this 
advertisement is targeted. 

The meta Data contained in an advertisement is represented schematically in the Figure 
23c. Here "Number of Target Records" refers to the number of "Target Records" included in this 
25 v^j^g^^-M^^aia 11 . Each "Target Record" refers to a demographic characteristic that this 
advertisement is targeted to. The Target relationship rule" is used to determine the relationship 
rule. For example, the advertisement may be targeted towards users who satisfy any one of the 
following criteria: 

income level is between Ix & Iy 
30 - ethnic background is ebl 

The "Target relationship rule" would be used to specify the above relationship. 
The "Demographic Trait Data" determined by analysis of viewing habits of a 
representative sample with known demographic characteristics and the "Target Ad Meta Data" 

are broadcast to a Program Selection Device from a Broadcast Head-end along with program 
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content and.£PG data. This is illustrated in Figure 22. This is received by each user at each 

Program Selection Device to determine the user demographic profile. Broadcast of the 

"Demographic Trait Data" and "Target Ad Meta Data" along with the advertisement content 

may be done on a periodic basis, or may be made always available on the broadcast networks or 

5 through some other communication mechanisms on an as-needed basis. 

At the Program Selection Device the "Demographic Trait Data" and "Target Ad Meta 

Data 11 are collected and stored in a storage device such as, but not limited to, a hard disk, flash 

ROM, or main memory. Collection of this data may be done at fixed time periods or whenever 

suitable depending on the embodiment chosen. For each trait record in the "Demographic Trait 

10 Data", the Program Selection Device goes through past Selection History. Each program on the 

'Trait Value Record' 1 that is available in the past Selection History is used in the computation of 

the Belief Function. The Belief Function Distribution graph in the "Trait Value Record" is then 

used to determine the probability of the user having that demographic characteristic. If an earlier 

probability for this demographic characteristic is available a weighted average of the old 

1 5 probability and the newly determined probability is taken and stored. Each targeted 

advertisement where the "Probability satisfaction criteria" of the "Target Record" is met by 

user's demographic profile is chosen for display at the appropriate time. 

As discussed elsewhere in the disclosure, in another preferred embodiment the present 

invention enables providing other targeted content to the user in addition to advertisements. 

20 Referring again to Figure 1, television control system 100 is preferably implemented by 

way of a general purpose digital computer and associated hardware that executes stored 

programs to implement the functions shown within block 100. The exact hardware and software 

platforms on which the television control system 100 is implemented is not important and may 

take a variety of forms. For example, television control system 1 00 may be implemented in a 

25 set-top box such as may typically used by individuals in the home to receive CATV signals. 

Another implementation of television control system 100 is in the form of a personal computer 

configured with the requisite hardware and software to receive and display television signals. An 

example of a set-top box that maybe programmed in accordance with the principles described 

herein is described in the following documents by IBM Microelectronics: "Set-Top Box 

30 Solutions", Product # G522-0300-00 (Nov. 19, 1997); "Set-Top Box Reference Design Kit", 

. GK10-3098-00 (April 15, 1998); "Set-Top Box Peripheral Chip", GK1O-3098-OO, (April 15, 

1998); "Set-Top Box Solutions: Helping Customers Meet the Challenges of Convergence", 

G522-0300-00 (Nov. 19, 1997); and "The challenges of convergence for Set-Top Box 

manufacturers", G599-0302-00 (Nov. 19, 1997). An example of an Application Programming 
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Interface (API) available for set-top boxes which can serve as a platform for the embodiments 
described herein is described in M API Requirements for the Advanced Set-Top Box" published 
by OpenCable (Oct. 21, 1997). An example of an operating system incorporating functionality 
to support the embodiments described herein is available from OpenTV, Inc. and is described in 
5 the following Technical White Paper publications by OpenTV, Inc.: "OpenTVTM Operating 
Environment" and "Application Development for OpenTVTM. " An advantage of such an 
operating system is the support provided in the form of function calls to obtain attribute 
information 107 from the signals 104. Alternatively, a general purpose operating system such as 
the Windows NT operating system from Microsoft Corporation may be used in conjunction with 

1 0 additional software that provides the functions required to extract the necessary information 
from attribute information 107 and to perform other manipulation of the received signals 104 
and the stored information 105. 

Storage devices 106 may include variety of different types of storage devices. For T 
example preference database 116 may be stored in a non- volatile, random-access semiconductor 

1 5 memory. Television programs 105 and attribute information 107 may be stored on storage ■ m * 
devices having greater capacity such as a conventional magnetic, hard disk drive. In general, 
storage devices 106 are understood to encompass a variety of storage devices* The exact form of 
the storage devices 106 is not critical so long as the storage devices have the capacity and speed 
to store the necessary information. Storage devices 106 may also comprise a conventional video 

20 cassette recorder (VCR) which operates under control of system 100 to store television programs 
105 and attribute information 107 on conventional magnetic tape. 

For the purposes of the present description, the television control system 100 is 
presumed to be integrated into, or coupled to, a system including a tuner and other functions 
necessary to receive television signals and to extract the attribute information 107 from the 

25 television. signal and to perform other functions typically associated with the receipt and viewing 
of television si gnals . In certain embodiments, television control system 100 may operate in 
conjunction with a database agent that facilitates interaction with preference database 1 16 by 
causing storage and retrieval of information to or from the database in an optimal manner. The 
preference database 1 16 may be implemented by a commercially available database product 

30 such as the Oracle Light database product available from Oracle Corporation which also 

incorporates the functionality to implement the data base agent described above. 

Recording manager 1 12 causes recording of programs 105 by periodically initiating a 

sequence of steps shown in Figure 24. At 201, recording manager 1 12 sends a request to 

preference agent 1 10 for ratings of all programs at a particular time (X), or alternatively, for 
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ratings of all programs within a particular lime period (X). By way of example, the steps shown 
in figure 2 may be performed every six hours. In certain embodiments, the frequency with 
which the steps in Figure 24 are performed may be changeable by the user. Preference agent 1 10 
responds at step 202 by providing ratings, from preference database 1 16, for each program 
5 received from recording manager 1 12. Recording manager 1 12 then causes recordation of the 
programs at time X, or within time period X in accordance with the ratings received from 
preference agent HO. Specifically, programs having the highest rating are given highest 
preference for recordation and programs having the lowest rating are given lowest preference to 
recordation. The recordation is subject to storage capacity constraints. For example, if the 
1 0 highest rated program is one-hour long, and only thirty minutes of recording space is available 
on storage devices 106, then the one-hour program is skipped and the highest rated thirty-minute 
program is recorded. 

Highest priority for recording of programs is given to programs specifically requested by 
the user. For example, if the user identifies a particular program for recording, such as by 

1 5 specifying the date, time and channel, or by specifying an identification code for the program, 
recordation of that program is given priority over programs rated by the preference agent. Next 
highest priority is given to programs matching particular category-value pairs specified by the 
user. For example, if the user does not identify a particular program, but specifies that one-hour 
long documentaries pertaining to travel should be recorded, then recordation of programs 

20 matching such category-value pairs is given priority over programs rated by the preference agent 
110. In alternative embodiments, relative priority between user-specified programs, user- 
specified category-value pairs and programs rated by the preference agent 110 is changeable by 
the user. 

Recording manager 1 12 manages storage capacity on storage devices 106 by causing 

25 deletion of television programs 1 05 in accordance with ratings of such programs generated by 

preference agent 1 TO. This is performed in a manner similar to that explained above for 

determining which programs to record. Figure 25, which shows the steps taken by recording 

manager 1 12 to determine which programs to delete, is similar to Figure 24. At step 301, 

recording manager 301 requests ratings from preference agent 1 1 0 of all programs stored on 

30 storage devices 106. At step 302, preference agent 1 10 responds by providing the requested 

deletion ratings. At step 303, recording manager 112 responds by causing deletion, when 

needed, of programs in accordance with the deletion ratings received from the preference agent 

1 10. Specifically, when additional space on storage devices 106 is required to record one or 

more additional programs, recording manager 1 12 causes deletion, or overwriting of programs 
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having the lowest rating first. Thus, stored television programs which are determined by 
preference agent 1 1 0 to be least preferable, in relation to other stored television programs, are 
deleted or replaced first, and those determined to be most preferable are deleted or replaced last. 
Deletion of programs occurs only when required. Advantageously, this results in storage device 
5 106 typically being filled to maximum capacity, thus providing the user with as wide a variety of 
programs as possible. The user can specify programs that are to remain on the storage device 
106. Such programs are not deleted by the recording system 100 in the steps shown in Figure 25. 
In addition, the user can specify programs that are to be deleted, and therefore override the steps 
shown in Figure 25. 

10 In certain embodiments, the preference database is used by system 100 to alter the 

manner in which information about currently broadcast programs is presented to the user. For 
example, in such embodiments, the preference database is used to rearrange the order in which 
currently broadcast programs are presented to cause programs having attribute information 107 
rated highest by preference database 1 16 to be presented first. Alternatively, the preference 

15 database 1 16 can be used to organize information regarding the currently broadcast programs 
according to the liking of various traits stored in the preference database. 

Figures 26a and 26b illustrate alternative hardware configurations for systems employing 
the principles of the present invention. Figure 26a illustrates a hardware configuration that 
supports storage and retrieval of digitally encoded audio and video. Interface 902 is a standard 

20 digital cable or digital satellite input interface. Interface 906, which is the hardware interface to 
storage devices 106 preferably takes the form of an IDE or SCSI interface, or the proposed 
IEEE- 1394 interface. Interface 906 is an NTSC or PAL encoded video interface. If the television 
signal 104 takes the form of an analog signal, as in the case of most current television broadcast 
signals, and CATV signals, then the signal 104 must be digitized and generally compressed (for 

25 example, by the MPEG-II standard) before storage on a digital storage medium such as shown in 
Figure 26a. 

Figure 26b illustrates an embodiment using an analog storage device 106 such as a 

conventional VCR. If the television signal 104 is analog then the interface 910 takes the form of 

a conventional NTSC or PAL interface. If the television signal 1 04 is digital then the interface 

30 910 takes a form as interface 902 shown in Figure 26a and a digital-to-analog converter is 

required to convert the received signal to analog form before storage on storage device 106. 

Figure 27 illustrates operation of an automatic pause-record feature of preferred 

embodiments. If a user is watching a currently broadcast program and wishes to stop or 

temporarily pause viewing of the program, the recording system 100 advantageously allows the 
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program to be recorded so the user can resume viewing the program at a subsequent time. As 
shown in Figure 27, at 1002, the user is viewing a currently broadcast program. Generation of a 
pause input at 1004 by the user, such as by pressing of an appropriate button on a remote control 
coupled to the recording system 100, causes the system 100 to cause at 1006, recordation of the 
5 program being viewed by the user. The user is then free to watch another program or stop 

watching the television 108 altogether. At a subsequent point in time, if a resume viewing input 
is received, such as by pressing of an appropriate button on the aforementioned remote control, 
then at 1010, the recording system 100 causes the program recorded at step 1006 to be retrieved 
and shown on the television 108 from the point the recordation w as initiated at step 1006. If the 

10 program is still being broadcast when step 1010 is initiated, then recordation of the program 

continues by the system 100. The user thus can easily interrupt viewing of a currently broadcast 
program and resume subsequent viewing. 

Preferably the recording system 100 supports a variety of functions such as fast-forward, 
rewind and visual scan of stored programs, and other functions supported by the storage medium 

15 106. For example, if the storage medium 106 takes the form of a VCR then the program viewing 
and manipulation functions will be limited to the standard VCR functions of fast-forward, 
rewind, forward or reverse visual scan. If the storage device 106 takes the form of a digital 
storage medium then more advanced program search and retrieval functions can be supported. 
It will be appreciated that the fiinctions performed by the preference agent 110 and the 

20 recording manager 1 12 are illustrative of a particular embodiment. However, the division of 
tasks between the two modules 1 10 and 1 12 may be changed. In addition, the data formats 115, 
116, 105 and 107 may also take a variety of forms. 

Figure 28 illustrates an EPG and text information service in accordance with one 
embodiment of the present invention. As shown, a local cable television company's billing 

25 / vendor 10 communicates via a billing link to an RS-232 port of a system manager 12 located at 
a cable head end. Billing vendor 10 includes a subscriber database and generates a monthly bill 
for the subscribers in the system based on the level of service and any pay-per-view purchases. 
System manager 12 can be a personal computer or other processing device which receives 
viewer authorization transactions from billing vendor 10 and generates transactions for delivery 

30 to the distribution apparatus or the subscribers. Such transactions include text channel definition 
transactions which instruct the subscriber's set top box which group of channels it is entitled to 
receive, which frequency to tune for a particular text data channel, whether to mute the audio for 
that text channel, the pagination delay between pages, and the like. 
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System manager 12 also communicates via a head end link to an RS-232 port of a head 
end controller (HEC) 14 which controls the transmission of television programming to the 
subscribers. As will be described in more detail with respect to Figure 29, HEC 14 
communicates via a control link to an RS-232 port of an information services processor (or data 
5 controller) 16 which manages the flow of EPG and text data in accordance with the invention. 
As shown by dotted line in Figure 28, information services processor (ISP) 16 is preferably 
located at the cable head end with system manager 12, HEC 14 and the signal scramblers. 
However, those skilled in the art will appreciate that all of the head end equipment need not be 
located at one site. 

1 0 EPG data is supplied from one or more local or remote EPG suppliers 1 8 via a satellite 

link, modem link or other communication link to an RS-232 port of ISP 16. Similarly, text data 
from one or more text channel suppliers 20 is provided via a satellite link, modem link, or other 
communication link to another RS-232 port of ISP 16. In preferred embodiments, ISP 16 has a 
plurality of identical RS-232 ports for accepting data from a plurality of EPG suppliers 18 and 

15 text channel suppliers 20. 

Also, as shown, one of these RS-232 ports can be used for a control link to HEC 14 as 
well. ISP 16 manages EPG and text source databases in response to control signals from HEC 14 
in order to provide EPG data and/or text channel data to selected viewers. 

HEC 14 also provides control data directly to the viewer's set top box via an RS-485 

20 output port. Preferably, the control data from HEC 14 can include text channel definition 
transactions as well as EPG definition transactions for instructing the set top box at which 
frequency to tune for the EPG data and the like. The control data may also include software for 
downloading into the viewer's set top box for reprogramming the viewer's set top box as 
necessary. The control data from HEC 14 can be inserted into the vertical blanking interval of 

25 the selected cable television signal by diaisy-chained scramblers 22, 24 and 26 using known in- 
band techniques, although the control data from HEC 14 may also be modulated on an out-of- 
band carrier or an in-band audio carrier for transmission as. Preferably, scramblers 22-26 are 
daisy-chained so that the scramblers may be addressed individually or globally. 

Similarly, EPG data and text channel data from ISP 16 are provided to the viewer's 

30 television set top box via an RS-485 output port of ISP 1 6. EPG data and text channel data are 

similarly inserted into the vertical blanking intervals of selected cable television signals by EPG 

scrambler 28 and text channel scramblers 30 and 32, respectively. Scramblers 22-32 may insert 

the control data, EPG data, and text channel data into other portions of the video signals such as 

the horizontal blanking intervals or else replace the video entirely. Typically, the number of 
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scramblers depends on the number of premium channels for which scrambling is used. 
Preferably, EPG scrambler 28 and text channel scramblers 30 and 32 are identical to control data 
scramblers 22-26 and are similarly daisy-chained for individual or global addressing. As shown 
in Figure 28, scramblers 28-32 receive a single serial data channel which carries the combined 
5 EPG data and text data and display control transactions, as described in more detail with respect 
to Figure 29) for all data streams in use. Each scrambler is also equipped with memory for 
storing a predetermined amount of this data in an internal memory so as to minimize the number 
of database accesses. Preferably, scramblers 28-32 have internal memory sufficient to store a 
significant number of transactions. For example, scrambler 30 may have enough internal 

1 0 memory to score a day's sports scores for display on a sports text channel. The data received and 
stored in scramblers 28-32 is preferably in RS-485 format, and the protocol in a preferred 
embodiment is SDLC. All data transactions to scramblers 28-32 are sent on individual data 
streams specifying the target scrambler (station addresses in SDLC protocol), and the control 
data is sent on a global data stream which is filtered in the scramblers 28-32 based on the 

15 address of the scrambler so that the data streams can be configured by a transaction from ISP 16. 
The individual EPG data and text data streams are preferably generic in the scramblers so that 
they can be allocated as desired. Preferably, scramblers 28-32 have baud rates of at least 9600. 

Preferably, the subscriber's set top box is a set top box 34 which comprises an EPG 
memory 36 for storing the EPG data from ISP 16. For example, EPG memory 36 may store one 

20 or two weeks of EPG data for selective access by the viewer via a menu of the set top box 34. 
This menu preferably allows the viewer to scroll through the EPG data stored in EPG memory 
36 using the key pads of the viewer's television remote control device. Set top box 34 may also 
comprise a nonvolatile template memory 38 for storing the template in which the EPG data is to 
be inserted for display to the viewer on the viewer's television 40. In this manner, a video signal 

25 containing the template display data need not be continuously retransmitted to the set top box 
34, thereby saving more bandwidth. Instead, the EPG data only needs to be updated every 30 
minutes or when there is a program change. Of course, different set top boxes 34 may have 
varied amounts of memory and processing capabilities for such purposes in accordance with the 
acceptable memory costs during manufacture of the set top box 34. 

30 As shown in Figure 28, set top box 34 may also comprise a text data memory 42 for 

storing a page of text data for presentation to the screen. Thus, while one page of text data is 
displayed to the subscriber, the next page of text data may be loaded into the text data memory 
42. 
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ISP 16 manages the flow of text data and EPG data from the data service provider to the 
viewer's set top box 34. ISP 16 manages this data by accepting data only from one or more 
authorized text data and/or EPG data sources, processing the text data and EPG data in its 
internal database manager, and formatting the processed data into a common data transaction 
5 format for output to the scramblers for transmission to the set top box 34. Provision of EPG data 
and text data to the subscribers is controlled by the head end controller 14 via the control link. 

One example of a suitable ISP 16 is an IBM PS model 7546 personal computer with a 
plurality of RS-232 serial input ports for EPG data and/or text data inputs and at least one 
RS-485 HDLC serial link at its output of the type used by HEC 14. As shown in Figure 28, the 
10 control link can be a single RS-232 serial port. The hardware arid software components of ISP 
1 6 are then configured as illustrated in Figure 29. 

One embodiment of ISP 16 is illustrated in Figure 29 as a plurality of RS-232 ports 
which provide a common interface for the EPG data and text channel data asynchronously 
provided by the EPG suppliers) 18 and text channel suppliers 20. The EPG data and text 
1 5 channel data is transmitted to ISP 16 via a satellite link (when the interface is operated in 

simplex mode) or by modem (when the interface is operated in half duplex mode). Preferably, 
the data is transmitted at a baud rate of at least 1200. 

ISP 16 functions as a "gate keeper" which only allows access by authorized data sources. 
Accordingly, when ISP 16 receives a message from an EPG supplier 18 or a text channel 
20 supplier _0, it first checks the data for authorization. If that supplier is not authorized, the data is 
ignored. When the supplier is authorized to access ISP 16, ISP 16 performs the requested action 
and returns a command response message. If the communications link is simplex, the response is 
ignored. In this manner, access to ISP 16 can be limited by authorization codes, but as will be 
described below, access is also limited by whether the data provider provides EPG data or text 
25 data in the transmission protocol expected by ISP 16. 

Messages sent between an EPG supplier 18 or a text channel supplier 20 and ISP 16 can 
be formatted to include a start of text byte, a data block of ASCII characters, checksum bytes 
and an ASCII carriage return. This format is used in commands sent to ISP 16 from the data 
suppliers as well as in responses sent to the data suppliers. The checksum can be a two byte 
30 CRC of all bytes in the message field beginning with the first character following the start of 
text character up to but not including the checksum field. The checksum is transmitted in the 
message as the hexadecimal ASCII representation (four bytes) of the CRC computation. The 
data blocks can be configured differently depending upon whether the input data is EPG data or 
text data. 
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EPG data from the EPG supplier 18 can be formatted in accordance with an EPG 
command set including, for example, a Define Program Command which is used to identify all 
data relating to a single program, a Define Category Command which is used to establish a 
category for identifying different types of programs, and a Delete Category Command which is 
5 used to delete an unused category to make room in the database of ISP 16 for new programming 
categories. The EPG data may be formatted on a per program basis by these commands. 

Delimiter characters can be used for variable length fields such as the title and program 
description blocks to identify the length of the field. For example, a NUL (0 hexadecimal) 
means the field is null, SOH (I hexadecimal) means the field is valid, and ETX (31 hexadecimal) 
1 0 means the end of the current record. 

Once data transmitted with a Define Program Command is stored in an EPG database of 
ISP 16, the EPG data is formatted into transactions for transmission to the set top box 34. This 
command may also be used to update a program definition since it will overwrite a 
corresponding entry in the EPG database of ISP 16. 
15 ISP 1 6 may respond to such commands from the EPG supplier 1 8 by sending an 

» 

appropriate response such as: no error (normal response), service provider not found (not 
authorized), type of service not found (not authorized), category ID riot found, unrecognized 
command, checksum error, insufficient disk space, and the like. Other EPG commands and 
command responses may be provided as desired. 

20 The text channel data can originate from many different text channel suppliers 20 and 

arrive at the ISP 16 through different communications links such as satellite, dial up modem, 
direct connect modem or with a direct connect to the system manager 12. 

ISP 16 can include a plurality of databases and database managers. As shown in Figure 
29, there may be two types of databases maintained in ISP 16-—one type for EPG data and one 

25 for text channel data. The EPG database is designed to collect data from each EPG supplier and 
to sort each EPG program record by channel and time of day. A separate database is created for 
each text channel for collecting text data from the associated text channel supplier 20 and 
formatting the received text data for transmission on individual text channels using the 
techniques to be described below. Each database that is created is identified by the service 

30 provider and type of service codes listed in the Define Program Command for use in the control 

link commands provided to ISP 16 from HEC 14. 

As shown in Figure 29, a received command is checked for its command code, the 

service provider, type of service and authorization code, as appropriate, by router and formatter 

43. If the command is from an unauthorized data source, the subsequent data is ignored. 
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However, if the received data is from an authorized supplier, router and formatter 43 routes the 
data to the appropriate database within ISP 16. For example, if EPG data is received, it is routed 
via EPG database manager 44 to EPG database 46. 

In one embodiment, EPG database manager 44 sorts the received EPG data by channel 
5 and time of day and stores the received EPG data in the appropriate location in EPG database 46 
for later recall. EPG database manager 44 may also perform garbage collection on the EPG 
database 46 as records are deleted. EPG database manager 44 may also call a data compression 
software routine such as the Huffman Compression Algorithm which, as known to those skilled 
in the art, maps more frequently used characters to fewer bits than the usual eight bits used in 

1 0 normal ASCII, while giving the less frequently used characters more bits. The number of bits 
used for a character is based on its probability of appearing in the data stream. Huffman 
encoding is described in detail in an article entitled "Lossless Data Compression", Byte, March, 
1991, pp. 309-314, incorporated herein by reference. Such a routine is desired to maximize 
storage efficiency at EPG database 46. Similarly, each text database manager can store the text 

1 5 information in the associated text database and performs data compression. 

* 

Router and formatter 43 and database managers 44, 48 and 52 are all controlled by 
configurator 56, which is, in turn, responsive to control data from HEC 14. Configurator 56 
responds to control commands from HEC 14 to provide updated authorization information to 
router and formatter 43 for comparison with the incoming data and for adding/ subtracting 

20 database managers and databases and the like as EPG suppliers 1 8 and text channel suppliers 20 
are added and subtracted from the system. 

Access to ISP 16 is can be carefully controlled through the use of authorization codes. In 
addition, ISP 16 can maintain control over the information services provided to the viewer by 
storing the EPG data and text data in a particular format in the appropriate database within ISP 

25 16. 

Referring now to Figure 30, the EPG database key can be a combination of the date and 
time field and the channel number from the EPG data. Following these fields are the duration 
field, the repeat field, the program rating field, the program category field, the critique field, the 
attributes flag field, the program traits flag field, the text data compressed flag and lastly the text 
30 data. The text data field may further consist of several optional subfields with a delimiter 
between each field. 

EPG database manager 44 can access the EPG database 46 through shared library 

routines such as add a record, delete a record, read a record, and the like. As the EPG database 

46 is used, it can be fragmented as records are added and deleted, and as a result, EPG database 
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manager 44 may further include garbage collection routines for periodically performing the 
garbage collection function on the EPG database 46. The text databases are similarly configured 
except that garbage collection is not necessary. 

In one embodiment, EPG transaction formatter 58 reads the database records of EPG 
5 database 46 and formats them to program-based transactions having a predetermined number of 
bytes which are transmitted to the EPG scrambler 28 for insertion into the vertical blanking 
interval of a video signal and transmission to the set top box 34. These transactions are then sent 
via a transaction arbitrator 64 to the EPG scrambler 28 shown in Figure 28 for insertion into the 
appropriate video channel. 
10 The transactions from transaction arbitrator 64 can be output to a single RS-485 output 

port of ISP 16 which is connected to multiple scramblers of the type used to scramble premium 
cable channels. The transactions are segmented into EPG data and text data streams for 
transmission to the EPG scrambler 28 (if the transaction includes EPG data) or to the text 
channel scramblers 30 and 32 (if the transaction includes text data). 

15 In one embodiment, the EPG transactions generated by EPG a transaction formatter 58 

are formatted into SDLC frames as noted above. A sample SDLC format for the EPG 
transaction data is shown in Figure 31. In Figure 31, the beginning flag delineates the beginning 
of the SDLC frame, the station address delineates the scrambler to be addressed, the control byte 
is a command code that defines what is to be processed, the information field contains the EPG 

20 data formatted as in Figure 30, the frame check contains the CRC for all data between the 
beginning and ending flags, and the ending flag delineates the end of the SDLC frame. A 
transmission from EPG transaction formatter 58 will address a specific data stream and a 
response from the EPG scrambler 28 will identify its data stream in the station address location. 
As noted above, such transmissions may or may not require a response from the EPG scrambler 

25 28. 

The EPG transactions typically include an Add EPG Block command including a byte 
specifying that the following data is from the EPG data stream, a control code byte specifying, 
for example, whether a reply from the scrambler is expected, two bytes setting forth the EPG 
data block number, a flag setting forth whether the EPG data is Short Term or Long Term data, 
30 the number of transactions which make up the EPG data block, and the actual transactions. EPG 
transaction formatter 58 may also generate a Delete EPG Block command which specifies that 
the data is to be deleted from the EPG data stream, the control code byte, and the EPG block 
number to be deleted. 
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Figure 32 illustrates a flow chart for one embodiment of software that can be used and 
embodied in EPG transaction formatter 58. As shown, the software starts at step 500 and gets 
the system time and date from the ISP system clock 63 at step 502. An expired EPG data block 
is then deleted from the memory of the EPG scrambler 28 at step 504. An expired EPG data 
5 block is defined as a data block representing a program which has been completely aired prior to 
the current system time or a program which was aired before the time window used for the EPG. 
At step 506, current EPG data blocks having a time and date within the EPG time window are 
read from the EPG database 46. The current EPG data blocks are then formatted into Add EPG 
Block commands and associated transactions at step 508. A block/time map of EPG transaction 

10 formatter 58 is then updated at step 510. The block/time map preferably stores the time that each 
EPG data block was sent to EPG scrambler 28. The transactions representing the EPG data are 
then transmitted the EPG scrambler 28 at step 512. EPG transaction formatter 58 then waits at 
step 514 for the next EPG update (which should occur when the system time enters a new half 
hour) or the next EPG change (which may occur at any time). Upon receipt of such an update or 

1 5 change, control returns to step 502. Text transaction formatters 60 and 62 similarly generate text 
transactions for the text data, which as noted above, is defined on a per screen (rather than per 
program) basis. Hence, an Add Text Screen command is similar to an Add EPG Block command 
except that the text channel number and screen number are provided in place of the EPG block 
number and Short Term/Long Term data bytes. The text transaction formatters 60 and 62 may 

20 also request the time from the scrambler so that proper pagination may be maintained. 

Figure 33 illustrates a flow chart for one embodiment of software that can be used and 
embodied in text channel transaction formatters 60, 62. As shown, the software starts at step 600 
and reads a text screen record from the text database 50 or 54 at step 602. At step 604, the text 
screen is formatted into Add Text Screen transactions for transmission to the text channel 

25 scramblers 30, 32 at step 606. Preferably, such transactions are formatted such that the display 
characters are sent as display commands rather than as separate characters for every display 
coordinate of the text display screen. Then, at step 608, text transaction formatter 60, 62 waits a 
period of time specified by system manager 12 (if auto-pagination is used) before the next text 
page is formatted and transmitted to the text channel scramblers 30, 32. At the end of this period 

30 of time, control returns to step 604 and the next text screen record in the text database 50, 54 is 

formatted for transmission to text scramblers 30, 32 for insertion in the vertical blanking interval 

of a particular video signal. 

Text data can be passed to the screen by sending a separate character for each display 

location of a page. In other words, if a text screen comprises 16 lines and 4 characters per line, a 
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text screen is represented by sending 384 (24X16) characters, one for each display location for 
that display screen. A blank space character is sent to indicate that no character is present in a 
particular text screen location. 

In one embodiment, the text data is transmitted to the screen along with appropriate 
5 commands for controlling the display of the text data. For example, a first display command in a 
sequence identifies the following data as text data and instructs the set top box 34 to fill the 
television screen with a blue background or some other background or template over which the 
text will be displayed. The text data is then converted into a series of commands which together 
identify the separate screens of text. As noted above, the text data is grouped on a per screen 

10 basis, which allows the appropriate delay mechanism to be incorporated into the display 

commands to provide the necessary delay between the presentation of respective text screens. 

For this purpose, transaction formatters; 60 and 62 can include software for scanning the 
text data for actual characters, skipping extra spaces in the text data, and grouping the actual text 
for transmission in transactions of a designated size which will fit in the vertical blanking 

15 interval of a field of a video signal. Since spaces are eliminated, the display commands include a 
coordinate specifying the row and column address of the first display character on the screen and 
a number of contiguous characters follow that character in the same transaction until the 
transaction is filled or a number of successive spaces are encountered. 

Attribute information such as underline, blinking, or luminance inversion associated with 

20 the characters may also be transmitted using these display commands. These display commands 
are used to read the text data for a text screen from the appropriate database, and at the end of 
the text data for a text screen, a display command is transmitted to indicate that all data for that 
text screen has been transmitted. The transaction formatter 60, 62 also includes a wait loop or 
"timeout" command at the end of the transmission which builds in a delay (on the order of 7 

25 seconds) .which gives the viewer sufficient time to view a text screen before the text data for the 
next text screen is displayed, thereby providing auto-pagination of the text screen. 

Auto-pagination permits the viewer to automatically advance from one text screen to the 
next without any intervention by the viewer. In accordance with the auto-pagination scheme of 
the invention, the cable operator can specify the time duration between screens and forward this 

30 information to the transaction formatters; 60, 62. Then, during operation, when the viewer 

selects a text channel, the current page of text data is displayed by extracting the selected text 

channel data from the vertical blanking interval of the video signal in which it is inserted and 

mapping that text data to a channel of the viewer's television which is designated for display of 

that text channel. The next screen of text data will be displayed after a predetermined delay 
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which gives the viewer sufficient time to read the displayed text data for the current screen 
(approximately 7 seconds). This technique could replace the commonly used "barker" channel 
which uses a computer to generate text data which is then transmitted as a complete video 
channel over the.cable television system. 
5 Configurator 56 can respond to control commands from HEC 14 to provide updated 

authorization information to router and formatter 43 for comparison with the incoming data and 
to add/subtract database managers and databases and the like as EPG suppliers 18 and text 
channel suppliers 20 are added and subtracted 57 from the system. The control link between 
HEC 14 and configurator 56 can be used to report the status of the ISP 16 to system manager 12. 

1 0 Additionally, the control link may accept text data from system manager 1 2 for displaying 
system messages and the like. 

In one embodiment, the interface between configurator 56 and HEC 14 is an RS-232 port 
with a data format fixed at, for example, 9600 baud. All control data is preferably transmitted as 
ASCII characters. Upon receipt of a message from HEC 14, configurator 56 checks the data, 

1 5 performs the requested action, and returns a command response message in a message format of 
the type described above for communications between router and formatter 43 and the EPG and 
text channel suppliers. Sample commands sent from HEC 14 to configurator 56 over the control 
link include a Set Date and Time command (for synchronization purposes), Request 
Configuration commands, Request Status commands, Get Category Record commands, 

20 Scrambler Control commands, and Database Control commands. 

In one embodiment, during operation, ISP 16 monitors all input ports data from the EPG 
and text data service providers and builds a list of all available EPG and text data services. This 
list is sent to the system manager 12 using a Request Configuration command. This command 
specifies the available service providers, the type of service (EPG or text data) from each 

25 provider, the communications port associated with each service, the scrambler address or data 
stream (EPG or text data) for each service, the authorization code from the supplier for each 
service, the time and date of the last update from the service provider, the time and date of the 
last update to the scramblers, the time and date of the latest EPG data in the EPG database, and 
the like. Such information is provided to the system manager 12 for each service provider when 

30 this command is given. 

The Request Status command can contain flags indicating whether there are errors 

present in the error log and if the category list has changed since the last Request Status 

command was received. Get Error Record and Get Category Record commands may then be 

used to extract the error and category information. 
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In one embodiment, configuration commands are separated into EPG and text service 
configuration commands. A Configure EPG Service command specifies the service provider, the 
type of service, whether the service is to be enabled or disabled, the authorization code from the 
EPG supplier 1 8, the scrambler data stream for Short Term data, the scrambler data stream for 
5 Long Term data, the length of the Short Term data in hours (1-255), and the length of the Long 
Term data in hours (1-4096). The Configure Text Service command specifies the service 
provider, the type of service (weather, sports, and the like), whether this service is to be enabled 
or disabled, the authorization code from the text channel supplier 20, the scrambler address or 
data stream for the text data, the channel number, and the pagination delay time in seconds) 
10 before the next page of text data is to replace the current page of text data on the screen for auto- 
pagination. 

The scrambler control commands can include, for example, a Rebuild Scrambler 
Memory command which is used when a scrambler replaced and needs data to be reloaded in its 
memory and a Scrambler Configuration command for specifying the amount of scrambler 
15 memory in kilobytes. 

In one embodiment, the database control commands include, for example, a Clear 
Database command which is used to clear the database associated with a particular service and a 
Delete Database command which is used to delete the database associated with a particular 
service. Other database control commands such as a Download Category Map command may 
20 also be provided for establishing a list of the specified categories of program data in the EPG 
data. 

Figure 34 illustrates one embodiment of a set top box 34 that can be used with the 
present invention. As shown, set top box 34 includes EPG memory 36, template memory 38, 
text page memory 42, a set top box 700, and a set top processor 702 which reads commands 

25 from the vertical blanking interval of the incoming video signal and performs the appropriate 
action. For example, if the incoming command is a text channel definition or EPG definition 
command from HEC 14, the appropriate update of bit map 704 is performed. Similarly, if the 
incoming command is a display command including EPG data, that data is stored in EPG 
memory 36 and is displayed with the template stored in template memory 38 when the user 

30 makes a menu selection via television remote control unit 706 and remote receiver 708 

requesting display of the EPG data. The template data may be sent as part of EPG display 

commands if no template memory is provided. On the other hand, if the incoming command is a 

display command including text data, a page of that data is stored in text page memory 42 for 

presentation to the display a page at a time. The text page memory can be automatically updated 
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every few seconds by virtue of the delay built into the display commands from the text 
formatters 60, 62 if auto-pagination is enabled. Alternatively, the user can be allowed to 
manually access the text data in the memory. If manual access is provided, it is preferred that the 
text page memory hold at least the currently displayed text page, the previous text page and the 
5 subsequent text page in order to give the user the ability to scroll through the text data In either 
case, set top processor 702 preferably has the ability to request the next text page while the 
current page is being displayed so that the next text page is already loaded for display at the end 
of the screen delay time. The selected text, EPG or video signal is then modulated at modulator 
710 for display on television screen 40 at the channel specified in bit map 704. 

10 Bit map 704 of set top processor 702 of the set top box 34 maps the received text 

information to the designated cable channel for display by designating the frequency that must 
be tuned by box 700 to receive the desired text data. This information is received in the 
aforementioned text channel definition transactions from HEC 14. For example, the viewer may 
specify via television remote 706 that she wishes to view a sports text data channel which her 

1 5 program guide indicates to be available by tuning the set top box 34 to channel 181 . Set top 
processor 702 then checks bit map 704 for channel 181 to determine that it must tune the 
frequency for channel 29 in order to extract the sports text data for the viewer's channel 181 
from the vertical blanking interval of channel 29, set top processor 702 ten sets set top box 100 
to tune channel 29 but the video signal for channel 29 is not displayed. Instead, the video screen 

20 is blanked by set top processor 702 and the text data extracted from the vertical blanking interval 
by set top processor 702 is displayed. Any necessary descrambling of the received video is 
performed by set top processor 702. The viewer thus perceives that many more "virtual" 
channels are available even though a separate video channel was not used to transmit the text 
data. 

25 In one embodiment of the present invention, illustrated in Figure 42, electronic content is 

tagged and aggregated at a server and becomes electronic content with targeted information. 
Suitable electronic content includes but is not limited to, advertising, news segments, video 
programs, audio programs, WEB pages, and the like. The electronic content with targeted 
information is send to a broadcast server and broadcast widely. Set top box 34 has created 

30 profiles representing different people in the household. Set top box 34 matches the tags in the 
broadcast stream with the profiles created and determines which of the tagged content should be 
downloaded. A subset of all of the content that is broadcast that are relevant to the personalities 
in the household are downloaded and displayed to the user. 
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For example, set top box 34 can be used to match the tags in the broadcast stream with a 
demographic profile of a viewer previously created and determine which of the tagged content 
should be downloaded. An individual viewer receives from set top box 34 only.* Set top box 34 
matches the tags in the broadcast stream with the demographic profile of the viewer created and 
5 determines which of the tagged content should be downloaded. 

Some example of demographic characteristics include, but are not limited to, a user's 
gender, race, age and income. The output of the analysis of viewing habits of the representative 
sample provides a basis for determining demographic characteristics of the individual user. 

As shown, set top box 34 reads commands of the incoming tagged and aggregated 
10 electronic content and performs the appropriate action. For example, if the incoming command 
is a text channel definition or EPG definition command from HEC 14, the appropriate update of 
bit map 704 is performed. Similarly, if the incoming command is a display command including 
EPG data, that data is stored in EPG memory 36 and is displayed with the template stored in 
template memory 38 when the user makes a menu selection via television remote control unit 
1 5 706 and remote receiver 708 requesting display of the EPG data. The template data may be sent 
as part of EPG display commands if no template memory is provided. If the incoming command 
is a display command including text data, a page of that data is stored in text page memory 42 
for presentation to the display a page at a time. The text page memory can be automatically 
updated every few seconds by virtue of the delay built into the display commands from the text 
20 formatters 60, 62 if auto-pagination is enabled. Alternatively, the user can be allowed to 

manually access the text data in the memory. If manual access is provided, it is preferred that the 
text page memory hold at least the currently displayed text page, the previous text page and the 
subsequent text page in order to give the user the ability to scroll through the text data. In either 
case, set top processor 702 preferably has the ability to request the next text page while the 
25 current page is being displayed so that the next text page is already loaded for display at the end 
of the screen delay time. The tagged and aggregated electronic content is then modulated at 
modulator 7 1 0 for display on television screen 40 at the channel specified in bit map 704. 

Bit map 704 of set top box 34 maps the received tagged and aggregated electronic 
content to a designated cable channel for display. Set top processor 702 then checks bit map 704 
30 for channel 1 8 1 to determine that it must tune the frequency for channel 29 in order to extract 
the. Any necessary descrambling of the received tagged and aggregated electronic content is 
performed by set top processor 702. 
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The present invention also provides a mechanism to automatically create multiple 
profiles corresponding to multiple users with or without any information being explicitly 
providing by the users about themselves. Fig 35 shows an embodiment of the invention that 
causes the automatic creation of multiple profiles and automatic identification pf profiles when 
5 they are active on the device. 

In one embodiment, number of profiles to be created is initially determined. Several 
methods may be used to determine the number of profiles. In one embodiment of the current 
invention the number of profiles is optimally determined by applying the Minimum Message 
Length (MML) criterion. The process of applying MML criterion for determining the optimal 

1 0 number of clusters in described in reference (ref..). In one embodiment of the current invention, 
a user of the device explicitly specifies; the number of profiles to be created. 

The system monitors user interactions with the device. The user interactions may include 
but are not limited to channel change requests, requests to view more information, configuration 
of device parameters, requests to performs recording or deletion of programs from a storage 

1 5 device. All actions due to user interactions are recorded in a history database that stores history 
of viewer actions. Data records in the history database describing viewer action can be of 
different formats. Figure 37a shows one of the possible formats for the data record. Data records 
may contain information about the action, time at which the action occurred, and some 
parameters that further describes the action. In one embodiment of the invention, only actions 

20 matching certain criteria are recorded in the history database. For example, all user action about 
watching channels can be ignored if the duration of watching is less than a configurable 
threshold, or all user action about watching a particular channel like preview channel can be 
ignored. In the default usage of the device, the user will not be identifying himself or herself 
during each usage session. A usage session in this context can be the period during which there 

25 ' are user activities between two periods of inactivity, or the period during which the device is 
used between two periods during which the device is not in use. It will be left to the device to 
determine who the actual user is, in order to provide a very personalized environment to the 
user. In such a scenario the user action records will not contain any information about who the 
user is. In one embodiment of the current invention, the user can identify himself or herself 

30 explicitly for all or some of the usage sessions by using some mechanism provided by the 

system. These mechanisms may include pressing a sequence of keys, or choosing a user name 

from a menu and logging-in as that user. In this embodiment of the invention the user can 

identify himself or herself during certain sessions and may choose not to identify himself or 

herself during some other sessions. In the case where the user identifies himself or herself 
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certain user action records can have the user name or some other identification data as a 
parameter which specifies that a user was togged-in for the session from which the user action 
record was generated. 

By analyzing the user action history database that was generated for a period of time, the 
5 current invention provides a mechanism to create multiple profiles automatically. Each of these 
profiles may correspond to the entire preference of a single user, a group of related preferences 
of a single user, a group of related preferences of a group of users or the entire set preferences of 
a group of users. The mechanism for creating multiple profiles is described in figure 36. 

Sets of consecutive user action records are grouped together to form usage pattern 

10 records. Usage pattern records can be in the form of arrays of user action records. Only user 

actions that occur contiguously are grouped together in a single usage pattern record. The usage 
pattern records can be formed using many methods, some of which are: 

1 . Grouping together all user action records that are in a single usage session into a 
single usage pattern records. This is represented graphically in figure 38a. 

15 2. Grouping together all user action records that are in a single usage session into a 

one or more usage pattern records where each usage pattern record has a predetermined number 
of user action records. This is represented graphically in figure 38b. 

3. Grouping together a predetermined number of consecutive user action records in 
a usage session into one or more usage pattern records where each usage pattern record has a 

20 number of user action records which overlap with some of the user action records in an adjacent 
usage pattern record. This is represented graphically in figure 39. 

Each usage pattern record is mapped to a point in an N-dimensional space, each axis of 
this N-dimensional space representing a parameter that is of significance in identifying multiple 
profiles. The N-dimensions for this space are called cluster-dimensions. The parameters for this 

25 N-dimensional space are chosen either manually by people skilled at identifying the most 
significant parameters in identifying multiple profiles, or automatically by identifying the 
aspects of profiles that differs significantly between profiles. In one embodiment of the current 
invention a set of these parameters are identified and configured as the apparatus is configured 
during initialization, and a new set is added periodically by looking at aspects which differ in the 

30 multiple profiles that are stored in a device. A wide variety of combinations of initial parameters 

may be used. In one embodiment these initial parameters sire the channel names in a channel line 

up and viewing times. In the embodiment, these initial parameters are possible values of 

program description fields of television programs. In one embodiment the rate of channel change 

is also one of the parameters. The mapping of the usage pattern record to a point in the space 

.47. 



BNSDOCID: <WO 01 17250A1 JA> 



»*V<I . ■"■ 7 ------ 



WO 01/017250 PCT/US00/23868 

defined by cluster-dimensions can be done by many methods. One method determines the 
quantity of a particular characteristic which is defined by a clustering parameter, exhibited in a 
usage pattern record, e.g. number of channel changes in a usage pattern, and uses this as the 
value of the axis- for the corresponding dimension. One method determines the rate of 
5 consumption of a particular characteristics in a usage pattern record, e.g. rate of consumption for 
"NBC" in a usage pattern record is .5 which indicates that programs on "NBC" were watched 
50% of times during the period of this usage pattern record. 

All the points in the space defined by cluster dimension are clustered into a number of 
clusters using standard clustering algorithms. Any clustering algorithm can be used to perform 

1 0 the clustering. In one embodiment EM clustering is applied to group the points in the cluster- 
dimension space into a predetermined number of clusters. The number of profiles being created 
decides the number of clusters formed. Anyone skilled in the art of artificial intelligence and 
clustering, especially EM clustering, will be familiar with these clustering techniques. Each 
cluster formed as a resulting of the clustering represents a single profile. The clustering process 

1 5 also provides the mixture weights for each of the clusters as one of the outputs. The mixture 

weight for a cluster can be used to compute the percentage of time the profile was active, of the 
total amount of time for which the device was used. 

In one embodiment of the present invention, the clustering is performed periodically 
using the user actions records accumulated in the history database. In one embodiment, user 

20 action records are periodically removed from the history database based on certain criteria such 
as the age of the record, size of the record, relevancy of the record etc. 

The current invention also provides a mechanism to predict profile that is active at any 
given time. Figure 40 illustrates the method for identifying the profile currently active on the 
device. Current user actions are monitored and user action records are created. The most recent 

25 user action records are used to create a usage pattern record. The usage pattern record is mapped 
to a point in the cluster dimension. Using the Information about clusters representing multiple 
profiles, the, probability of each of the clusters being active given the usage pattern is computed. 
This is computed by using Bayesian Probability theory that can be used to compute the posterior 
probability using the prior probability and mixture weights. The probability of a cluster being 

30 active is computed using the probability of the usage-pattern record given the profile 

(P/usage_pattern profile), the probability of the profile being active and the probability of the 

usage record occurring. The mathematical representation for the Bayesian Probability theory in 

the current context is given below 

P(profile/usage_pattern) = P(usage_pattern, profile) * P(profile)) P(usage_pattern) 
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In this equation probability of usage pattern record occurring given profile (P(usage- 
pattern, profile)) can be computed by knowing the probability distributions governing the 
clusters. In one embodiment, the cluster probability distribution can be assumed to be the 
Gaussian probability distribution, so that the P(usage_pattem profile) can be computed using the 
5 cluster center and cluster variation. Cluster center and the cluster variation are the output 
generated by the clustering process performed for generating multiple profiles. 

In one embodiment, switching the device on can be one of the user actions recorded and 
"time of usage" one of the clustering dimension. In this .embodiment, as soon as a user switches 
the device on, the probability for any cluster being active can be determined, even with out any 

10 further user actions. As the user performs more user actions, the current usage pattern record is 
refined to include more user action records and the probability of profiles which may be active is 
refined with the addition of each new user action record. As more user action records are added 
to the current usage pattern, a set of the oldest user action records may be removed from the 
current usage pattern record. This process ensures that the effect of individual user actions in the 

1 5 identification of active profiles decreases as time passes. 

In one embodiment of the current invention, the clusters generated by the clustering 
procedures are used to create multiple profiles using a method described in Figure 41 . In this 
embodiment, the clusters are used to compute the probability of each clusters being active 
during the period of each usage pattern record. User preferences for each profile are created 

20 using processes described above using the user actions weighted by the probability of the 
corresponding cluster being active. 

In yet another embodiment of the present invention, multiple profiles may be created for 
each user. Each of such multiple profiles may be used to describe the user's viewing preferences 
at various time periods, such as weekday or weekend, or morning or evening. The profiles may 

25 * also vary according to other variables, such as showing more sports during football season, more 
gardening shows during spring, more political commentaries every four years during the 
primaries, etc. Thus, once the system of the invention is activated and the user who activated the 
system is recognized, the system may further determined the time of day and/or week and access 
the appropriate profile for the user. 

30 With reference now to Figure 43, in another preferred embodiment of the present 

invention used to deliver advertising content to specifically targeted viewers based upon their 

individual viewer profile, the broadcast network amasses content from the many different 

content providers and combines it at a network broadcast head end 29 into a broadcast signal 

that is transmitted to, and received by, the television control system 2 of the present invention. 
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The control system 2 sorts through the content received from the broadcast network based upon 
the profiles of the different viewers in the particular household, compiles lists of preferred 
programs available to each user, records and stores programs of interest, and makes the 
programs available for viewing when required. Equally important, the control system 2 receives 
5 and sorts through various advertising content and stores the content most relevant to the viewers 
in the house hold based upon their demographic profiles and the target audience of the 
advertising content. 

With greater particularity, and as shown in Fig. 43, content providers include EPG 
suppliers 1 8, text channel suppliers 20, conventional programming suppliers 23, and alternative 

10 content suppliers 25. As described below, alternative content refers to highly targeted 

programming that may be delivered through the use of a control system 2 as described herein. 
Conventional programming encompasses all types of programming currently available and 
including, but not limited to, movies, news, sport events, educational programs, pay-per-view 
shows, and shopping channels. Advertising content suppliers 19 provide promotional material 

1 5 such as short video clips or graphical and text information to programming suppliers 23 for 
inclusion in their conventional programming content, in a manner familiar to TV viewers 
worldwide. However, in a novel method of delivering advertising content uniquely available 
through the use of the present invention, advertising content may be supplied to alternative 
content suppliers (see discussion below) and may also be profiled through a profiler 21 to 

20 develop meta data descriptive of each individual advertising message. As discussed elsewhere in 
the specification, the meta data refers to demographic characteristics that describe the viewers to 
whom each advertisement is specifically targeted. 

The profiler 2 i may be a software based system or may rely upon human intervention to 
watch and analyze each advertising message and develop meta data descriptive of the 

25 advertisement. Alternatively j the advertising content suppliers 19 may provide this information 
along with the advertisement. The meta data is developed based upon the parameters of interest 
to the clients for whom the ads are developed, and will typically include the age, gender, ethnic 
background, education, profession, and other demographic information related to the viewer. 
The meta data will thus describe the viewers to which each advertising message is targeted in 

30 terms of the viewers' demographic information as defined by the parameters mentioned 

previously. The meta data thus generated is then appended to each advertisement and the thus- 
tagged advertisements are provided to the broadcast network for inclusion in the broadcast 
signal. 
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The tagged advertisements will preferably be provided in a plurality of channels, wherein 
each channel may carry a specific type of advertisement in terms of the target audience. Thus, 
each channel may be defined according to one demographic parameter or an often recurring 
combination, e.g. a channel with ads for men, a channel with ads for women, a channel with ads 
5 for male sports fans, etc. Alternatively, each channel may carry a stream of all types of ads, 
wherein each type of ad is present in a ratio dependent upon the demographic parameters to 
which it is targeted. Thus, ads targeted to male sports fans on weekend afternoons will be more 
numerous than ads targeted to young children, because of the large number of sports events 
broadcast at such time and the high probability that young children will be playing outside rather 

10 than watching TV indoors at such times. Although providing tagged advertising content via a 
plurality of channels is not a requirement of the invention, it is believed that the sheer amount of 
advertising content available will require multiple channels as a practical matter. Furthermore, 
providing channels in a pre-sorted manner may reduce the amount and complexity of the sorting 
carried out by the control system 2 (as more fully described below). 

1 5 Also as shown in Fig. 43, the signal broadcast by the network may further include 

» 

information provided by the administrator of the service provided by the invention, e.g. 
Metabyte Inc., the assignee of this application. As described in more detail elsewhere in the 
application, this information may include updates of the software of the control system 2. 

With continued reference to Fig. 43, the network broadcast signal is transmitted to, and 

20 received by, the control system 2. The broadcast signal may be analog or digitally encoded, and 
may be transmitted via cable, satellite, telephone line, or any other practicable manner. In view 
of the large number of channels typically available and the state of the art, practical 
considerations will most likely dictate that the network signal be broadcast in digital form, in 
which, case both the network head end 29 and the control system 2 will need to incorporate ADC 

25 and, E? AC circuitry, in a manner very well known to those skilled in the art. Furthermore, it is 
understood that the control system 2 will need to incorporate channel tuning circuitry that, 
although not shown in the figure, will be necessary to separate the numerous signals multiplexed 
in the broadcast signal and select any one of these signals (i.e. channels) as necessary. 

Once received by the control system 2, the broadcast signal is supplied to various 

30 components of the system. The program switch 1 14 responds to the viewers input as provided 

via a remote control or similar unit to select the desired channel and direct the channel signal to 

the TV monitor 108. As described previously, the preference agent 1 10 monitors the viewing 

selection of the various viewers using the control system 2 and creates viewing profiles of each 

viewer that are stored in the preference database 1 16. Based upon these profiles, the preference 
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agent 1 1 0 sorts through the incoming programming content as described in the EPG information 
to compile lists such as "Top 10" lists of viewing choices available at any given time to each 
viewer, and directs the recording manager 1 12 to record the top-ranked program being broadcast 
at any given time (including any programs selected by the viewers for recording) and store it in 
5 the stored programs memory device 35. 

The preference agent further contains software that allows it to create a demographic 
profile for each viewer, based upon the viewing profile of the viewer and certain algorithms or 
associative rules. These algorithms may be adjusted over time as the model employed by the 
system administrator 27 is enhanced and its accuracy improves. To this end, the system update 

10 information channel included in the broadcast signal may include periodic software updates, 

including new preference database parameters that may need to be included at the request of the 
advertising suppliers 19. Thus, in one embodiment the control system 2 of the invention may be 
remotely upgraded to meet any new demands that may arise as advertising content providers 
become familiar with the system and the process of custom tailoring narrowly focused, targeted 

1 5 advertisements. The demographic profile created for each viewer is stored in the demographic 
database 31, which resides in the control system 2 and thus ensures the viewers privacy. 

The preference agent 1 10 also sorts through the advertising content streaming in through 
multiple advertising channels contained within the broadcast signal and, based upon the 
demographic profiles of the viewers and the meta data contained in each advertisement to 

20 describe the target audience for the particular advertisement, stores and/or causes the display of 
particular advertisements. The control system 2 may utilize any of a variety of methods to 
manipulate the advertising content, as described below. 

In one embodiment of the invention, the advertising channels each carry the same type of 
advertising. The preference agent determines which viewer is watching TV at any given time 

25 and stores in stored ads memory device 33 those tagged advertisements that are targeted to the 
particular viewer. At appropriate times during the program that is being watched by the viewer, 
such as during the commercial breaks typically inserted in most TV programs, the preference 
agent directs the program source switch 1 14 to access the stored ads and play a selected 
advertisement on the TV monitor 108. If the program being watched by the viewer contains 

30 information regarding the length of the commercial break, the preference agent may select stored 

ads of appropriate length to insert in the allotted time slot. The preference agent may further 

keep track of the ads that have been previously played to ensure that all stored ads are displayed 

equally. Alternatively, tagged ads may contain the desired number of times that the 

advertisement provider wants the ad to be aired during any given day, or perhaps the specific 

.52- ' " ~ * ^ . 



BNSDOCID: <WO 01 17250A1JA> 



WO 01/017250 PCT/US00/23868 

times at which the ad should be shown. Thus, after a number of ads having been stored, e.g. 
sufficient for a 24 hour period, the preference agent may review all of the stored ads meta data 
and develop a strategy for showing these ads to the viewer including when and how often. 

Alternatively, the preference agent may cause a certain number of ads to be displayed 
5 and direct the recording manager 1 12 to record the selected program if the stored ads being 
displayed run longer than the allotted time slot for the commercial break. The control system 2 
of the present invention can therefore manipulate the broadcast schedule of a program to a 
certain extent by modifying the amount of advertising content that the viewer is subjected to. 

In another embodiment, the advertising channels may be operational for only a limited 
10 time during the day, typically at off-peak hours (e.g. 2 a.m.), during which all advertisements for 
the coming day may be downloaded and stored. Thus, an advertising channel may be turned on 
at a certain time and stream all advertisements to the control system during the space of an hour 
or two. Alternatively, a regular programming channel may suspend programming for an hour or 
two at a convenient time and supply the advertisements for the following day. In yet another 
1 5 alternative, a dedicated connection such as a phone line or an internet connection may be used 

* 

for periodic downloads of advertisements. 

In an alternative embodiment, multiple advertising channels carry a mixture of 
advertisements such that at any given time the preference agent has the option of selecting at 
least one advertisement targeted to the viewer from one of these channels. In this embodiment 

20 the control system need only to store one advertisement at any given time to ensure continuity 
between the program being watched and the advertisements. Thus, by way of example, the 
commercial break in the program may occur at a time that does not correspond to the beginning 
of an advertisement on any of the advertising channels. In this case, the advertisement stored by 
the control system is directed through the program source switch to the TV monitor while 

25 another targeted advertisement is concurrently stored for subsequent display. If the commercial 
break happens to coincide with the start of a targeted advertisement on any of the advertising 
channels, the preference agent can simply cause the program source switch to direct the 
particular channel to the TV monitor while another advertisement from another advertising 
channel is being stored. 

30 In another embodiment, each regular programming channel can carry multiple, 

multiplexed versions of an advertisement. When a commercial break occurs, the preference 
agent selects the most appropriate version of the advertisement and directs it to the program 
source switch. This embodiment would require additional circuitry to de-multiplex the various 
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versions of each add and apply the particular version selected by the preference agent to the 
program source switch. 

The novel method of delivering targeted advertising that is provided by the invention is 
extremely advantageous to the advertising community. In one aspect, the invention allows 
5 greater freedom in producing advertisements because they no longer need to be appealing to as 
wide and varied an audience. Furthermore, because the alternative advertisements are delivered 
in real time, all viewers are reached by individualized advertisements at the same time , thus 
providing significant time savings in comparison with the prior art, wherein five different 
advertisements would require five different time slots during which to be broadcast. The cost 
1 0 savings thereby realized, especially during very popular events such as the Super Bowl, can be 
significant. 

As mentioned above, preference agent 1 10 may sort through available channels to select 
the ten (or any other number) programs currently playing that most closely match the viewer's 
viewing profile. In addition, the preference agent may also build customized listings of future 

1 5 programs based upon the viewer's profile as well as any additional criteria specified by the user. 
In a preferred embodiment the user will have the ability to fully customize his viewing profile, 
including the values assigned to the different parameters that make up his or her viewing profile. 
In an alternative embodiment the user may even be allowed to specify what kind of advertising 
he prefers. Such a configuration would likely generate an alternative billing arrangement, 

20 whereby the user agrees to watch advertisements in exchange for the ability to specify the types 
of advertising he wishes to be subjected to and perhaps some sort of financial incentive. 

Another novel possibility engendered by the present invention is the creation of a 
'personal channel' that is always showing the most interesting program being broadcast at any 
given time and/or previously recorded and stored programs that were very close matches to the 

25 viewer's profile. Thus, turning on to the personal channel will always guarantee the viewer the 
most individualized, interesting viewing experience available based upon the programs 
broadcast during a certain preceding time period that the viewer may specify. The personal 
channel may thus show a collage of movies, news segments, sports events, and any other 
programming content that was broadcast during the preceding 24 hours and that matched the 

30 viewer's profile to within a specified degree. 

An additional element of control system 2 that may be incorporated only in selected 

control systems is a privacy filter 37 that deletes any personal information from the demographic 

database and transmits this anonymous information to the system administrator 27 for purposes 

of maintaining and updating the model used to generate the demographic database. Such models 
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of the control system will require a feedback line for transmitting the demographic information 
to the system administrator, and in a preferred embodiment is a telephone line or a dedicated 
internet line such as DSL, cable, etc. The system administrator may offer financial or other 
incentives to users to convince them to supply this type of information. By carefully selecting 
5 users across a wide demographic cross section, the system administrator can use the information 
thus gathered to enhance the model used to develop a viewer's demographic profile based upon 
his or her viewer profile by comparing these users' actual demographic data with the 
demographic profile developed by the preference agent. 

Due to the narrowly focused nature of the advertising delivered by the invention, the 

10 targeted advertising developed for delivery by the control system of the invention may include 
novel elements such as coupons or highly targeted descriptions. Thus, targeted advertising 
developed for distribution via the present invention may include specific information, couched 
in specific language, that is intended to be especially appealing to the target audience. 
The invention further allows the creation of highly targeted content other than 

1 5 advertisements that can be delivered only to a very specific audience. Thus, movies, shows, 
religious programs, video magazines, infomercials, etc., may be developed to reach a very 
specific audience without the restrictions typically imposed on the content developers when the 
program will reach, or at least be accessible to, a very wide audience. This embodiment will 
require that such specific content be supplied via dedicated channels that cannot be tuned to 

20 directly by a conventional TV tuner, and thus may only be accessed through the control system 
2. Such highly targeted content may be provided by alternative content suppliers 25 or even be 
developed for alternative distribution by conventional programming suppliers 23. Thus, use of 
the present invention may create a new distribution medium that will allow the content providers 
to not only, reach a very specific audience but also to remotely, automatically exclude certain 

25 segments of the audience from accessing this material. Such alternative content could be 
broadcast exclusively on a viewer's personal channel, as described previously. 

The present invention thus enables the delivery of highly targeted content to a viewer 
that includes not only advertisements, but other programming and content. With reference now 
to Fig. 44, a time line 800 is shown tracking the progress of a typical television program 850. In 

30 conventional linear programming as currently available to TV viewers from broadcast as well as 

cable/DSS stations, program 850 is composed of scenes such as Scene 1, Scene 2, etc., and 

advertisements such as Ad 1 , Ad 2, etc. In accordance with conventional linear programming, 

the scenes are shown sequentially as predetermined by the producer of the program, and are 

interspersed with showing of the advertisements as determined by the head-end operator. In 
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accordance with the invention, customized linear programming is enabled due to the 
development of demographic and/or viewer preference profiles for each user. Thus, a 
customized linear program 860 may still be comprised of scenes interspersed with 
advertisements, but some or all of the scenes as well as the advertisements may now be targeted 
5 to various demographic or viewing traits of the user. Thus, Scene 2 may actually have been shot 
in two different versions by the producer of the television program, Scene 2 and Scene 2a, 
wherein each version of the scene is more appropriate, or of more interest, or less offensive, to a 
particular target audience. Thus, in one possible embodiment, Scene 2a may be a less violent or 
less graphic version of Scene 2. The system of the present invention receives TV signals 
1 0 carrying both scenes, and selects among the available alternatives based upon the profile of the 
currently watching user and the target data of the alternative scenes. Program 860 as shown in 
Fig. 44 is thus shown to the user in a linear, seamless manner as comprising Scene 1, alternative 
Scene 2a, alternative Scene 3b, alternative Ad la, alternative Ad 2c, Scene 4, Ad 3, and 
alternative Ad 4b. 

15 The same matching and selection process occurs with Scenes 3, 3a and 3b, as well as the 

advertisements. Thus, Ad 1 may be provided in four variations, each targeting a different 
demographic characteristics, be it the users sex, income, geographic location, or any other 
desired characteristic. As previously mentioned, providing such targeted ads provides a more 
interesting and enjoyable watching experience for the user, and thus enhances the likelihood that 

20 the user will watch the advertisement rather than switch channels or walk away. 

It is important to point out that the method of targeted advertising enabled by the 
invention is different from providing advertising messages by a programming guide. The present 
invention enables showing advertisements in an apparently conventional, linear manner whereby 
the broadcast of a television program is halted periodically by the head-end to broadcast 

25 advertisements. The advertisements thus shown are targeted to the individual user watching a 

particular TV set, but the user is not prompted in any manner to select between alternative ads or 
in any other way informed of the fact that alternative ads are being received by the set-top box. 
This is a different manner of providing targeted advertisements from providing targeted 
advertisements by a programming guide because, in a conventional programming guide context, 

30 the TV screen is typically split in at least two different areas, wherein one area shows program 
listings and the other area displays advertisements. Thus, in the context of a program guide, the 
user does not receive any programming at all, but rather only program listings and 
advertisements. 
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As shown in Fig. 44, alternate scenes and advertisements are preferably of the same 
length and thus occupy the same time slots within the program time line. However, it also 
possible to provide alternative advertisements and scenes of different lengths, such as those 
comprising television program 870. In this embodiment, the set top box would additionally 
5 fulfill the role of scheduler by calculating the length of each scene and advertisement and 
scheduling the scenes and advertisements for showing to the user in a seamless, apparently 
linear program with no interruptions or overlaps. In this embodiment both the scenes and the 
advertisements may be comprised of any combination of broadcast and previously recorded, 
stored scenes and advertisements. Thus, by way of example, in program 870 alternative Scene 

10 2a may be shorter than alternative Scene 2, which would necessitate showing Scene 3 b and Ad 
la at an earlier time. In this case, an additional Ad A may be shown between Ad la and Ad 2c, 
where Ad A is selected by the set top box because its target profile fits the demographic profile 
of the user and because it is of the appropriate length of time. 

Providing a customized program will also be able to reach a wider audience by catering 

1 5 more individually to specific traits and characteristics of the general viewing population. 

Furthermore, providing such customized content may also allow greater artistic and expressive 
freedom for producers of programming, as they will be able to explore different variations of the 
same story line and edit various scenes for various audiences. Another possible use for the 
customized linear programming enabled by the present invention is providing highly 

20 individualized services such as news, weather, stock market quotes, shopping events, 
instructional videos, etc. 

The popularity and acceptance of the system of the present invention will depend largely 
upon the cost to the end users, i.e. the viewers. As such, in one preferred financial arrangement, 
the user pays a set price for the control system, i.e. the hardware, connects the control system to 

25 his TV and incoming cable or satellite dish line, and enjoys the personalized services provided 
by the control system at no further cost. The system administrator will provide tagged 
advertisements for broadcast by the broadcast network and charge a fee to the advertisement 
client. The actual advertisements may be provided by an advertising content supplier or the 
advertising client itself (e.g. a truck manufacturer), to which the system administrator tags the 

30 meta data. The fee may be based upon the total number of control systems installed and/or the 

target audience that the advertising client wishes to reach. Thus, if a client wishes to air 

advertisements that are tagged to reach a relatively wide audience of viewers who have 

purchased and presumably installed the control system of the invention, the fee charged will be 

proportionally higher than the fee charged for a more narrowly focused advertisement. 
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Having now described the invention in accordance with the requirements of the patent 
statutes, those skilled in this art will understand how to make changes and modifications in the 
present invention to meet their specific requirements or conditions. Such changes and 
modifications may be made without departing from the scope and spirit of the invention as set 
5 forth in the following claims. 
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What is claimed is: 

1 . A method to deliver customized linear video programming to each of a plurality of 
individual viewers, comprising: 

processing information indicative of preferences of each of the plurality of viewers to 
5 develop viewer characteristics information for each of the viewers; and 

configuring a set of video programming segments for each viewer, at least one of the 
video programming segments selected from a plurality of available video programming 
segments, to create an apparently linear program for linear delivery to the viewer in accordance 
with the viewer characteristics information. 

10 

2. The method of claim 1 , wherein configuring a set of video programming segments 
further comprises: 

selecting at least one broadcast video programming segment for linear delivery to the 
viewer concurrent with the broadcast of the video segment to create the apparently linear 
15 program. 

3. The method of claim 1 , wherein configuring a set of video programming segments 
further comprises: 

selecting at least one video programming segment stored on a storage medium for linear 
20 delivery to the viewer to create the apparently linear program. 

4. The method of claim 1, wherein processing information indicative of preferences of each 
of the plurality of viewers comprises: 

processing information indicative of television program viewing preferences of each of 
25 the plurality of viewers. 

5. The method of claim 4, wherein processing information indicative of television program 
viewing preferences of each of the plurality of viewers comprises: 

processing information indicative of television programs watched by each of the plurality 
30 of viewers. 

6. The method of claim 4, wherein processing information indicative of television program 
viewing preferences of each of the plurality of viewers comprises: 
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processing information indicative of television programs recorded by each of the 
plurality of viewers. 

7. The method of claim 4, wherein processing information indicative of television program 
5 viewing preferences of each of the plurality of viewers comprises: 

processing information indicative of television programs not watched by each of the 
plurality of viewers. 

8. The method of claim 4, wherein processing information indicative of television program 
1 0 viewing preferences of each of the plurality of viewers comprises: 

processing information indicative of television program guide information requested by 
each of the plurality of viewers. 

9. The method of claim 4, wherein processing information indicative of television program 
1 5 viewing preferences of each of the plurality of viewers comprises: 

processing information indicative of television program guide information not requested 
by each of the plurality of viewers. 

10. The method of claims 5, 6, 7, 8, or 9, wherein processing information comprises: 
20 processing electronic program guide information. 

1 1 . The method of claim 1, wherein processing information indicative of preferences of each 
of the plurality of viewers comprises: 

processing information indicative of preferences of each of the plurality of viewers 
25 provided by each of the viewers in response to queries. 

12. The method of claim 1, wherein processing information indicative of television program 
viewing preferences of each of the plurality of viewers comprises: 

processing information indicative of preferences other than television program viewing . 
30 preferences of each of the plurality of viewers. 

13. The method of claim 12, wherein processing information indicative of preferences other 
than television program viewing preferences of each of the plurality of viewers comprises: 
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processing information indicative of musical preferences of each of the plurality of 
viewers. 

14. The method of claim 12, wherein processing information indicative of preferences other 
5 than television program viewing preferences of each of the plurality of viewers comprises: 

processing information indicative of reading preferences of each of the plurality of 
viewers. 

15. The method of claim 1 2, wherein processing information indicative of preferences other 
10 than television program viewing preferences of each of the plurality of viewers comprises: 

processing information indicative of shopping preferences of each of the plurality of 

viewers. 

16. The method of claim 12, wherein processing information indicative of preferences other 
1 5 than television program viewing preferences of each of the plurality of viewers comprises: 

processing information indicative of preferences other than television program viewing 
preferences of each of the plurality of viewers acquired from the group of sources comprising 
on-line music clubs, on-line book clubs, on-line special interest clubs and organizations, and on- 
line retailers and merchants. 

20 

17. The method of claim 1, 4, 1 1, or 12, wherein processing information indicative of 
preferences of each of the plurality of viewers to develop viewer characteristics information for 
each of the viewers comprises: 

processing information indicative of preferences of each of the plurality of 
25 develop Vtelevisidh program viewing preference profile for each of the viewers. 

1 8. The method of claim 1 7, wherein processing information indicative of preferences of 
each of the plurality of viewers to develop a television program viewing preference profile for 
each of the viewers comprises: 

30 processing information indicative of preferences of each of the plurality of viewers in 

accordance with a predictive model based on the viewing habits of a representative sample of 
the general population to develop a television program viewing preference profile for each of the 
viewers. 
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1 9. The method of claim 1 8, wherein processing the television program viewing preference 
profile developed for each of the viewers in accordance with a probabilistic model based on the 
viewing habits of a sample of the general population comprises: 

constructing a Bayesian network to calculate maximum a posteriori values for the 
5 parameters of the predictive model to predict television program viewing preferences for each 
viewer. 

20. The method of claim 17, wherein processing information indicative of preferences of 
each of the plurality of viewers to develop a television program viewing preference profile for 

1 0 each of the viewers comprises: 

processing information indicative of preferences of each of the plurality of viewers to 
deduce hidden traits of each viewer. 

2 1 . The method of claim 1 7, wherein processing information indicative of preferences of 

1 5 each of the plurality of viewers to develop a television program viewing preference profile for * 
each of the viewers comprises: 

processing information indicative of preferences of each of the plurality of viewers to 
deduce associated traits of each viewer. 

20 22. The method of claim 17, further comprising: 

processing the television program viewing preference profile developed for each of the 
viewers to develop demographic information for each of the viewers. 

23 ; The method of claim 22, wherein processing the television program viewing preference 
25 profile developed for each of the viewers to develop demographic information for each of the 
viewers comprises: 

processing the television program viewing preference profile developed for each of the 
viewers in accordance with a predictive model based on the viewing habits of a representative 
sample of the general population to develop demographic information for each of the viewers. 

30 

24. The method of claim 23, wherein processing the television program viewing preference 
profile developed for each of the viewers in accordance with a predictive model based on the 
v iewing habits of a representative sample of the general population comprises: 
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processing the television program viewing preference profile developed for each of the 
viewers in accordance with a predictive model that predicts each demographic trait of the viewer 
based on viewer preferences for television programs that attract a higher proportion of viewers 
exhibiting the demographic trait than is exhibited by the representative sample.of the general 
5 population. 

25. The method of claim 24, wherein processing the television program viewing preference 
profile developed for each of the viewers in accordance with a predictive model that predicts 
each demographic trait of the viewer based on viewer preferences for television programs that 

1 0 attract a higher proportion of viewers exhibiting the demographic trait than is exhibited by the 
representative sample of the general population further comprises: 

processing the television program viewing preference profile developed for each of the 
viewers in accordance with a predictive model that predicts each demographic trait of the viewer 
based on viewer preferences for television programs that attract a higher proportion of viewers 

1 5 exhibiting the demographic trait than is exhibited by the representative sample of the general 

# 

population and that exhibit minimal demographic trait correlation with other television programs 
that attract a higher proportion of viewers exhibiting other demographic traits than is exhibited 
by the representative sample of the general population. 

20 26. The method of claim 1 , wherein processing information indicative of preferences of each 
of the plurality of viewers to develop viewer characteristics information for each of the viewers 
comprises: 

processing information indicative of preferences of each of the plurality of viewers to cfetfeJo 
demographic information for each of the viewers. 

25 • 

27. The method of claim 26, wherein processing information indicative of preferences of 

ifevelop 

each of the plurality of viewers to Y demographic information for each of the viewers comprises: 

processing information indicative of preferences of each of the viewers in accordance 
with a predictive model based on the viewing habits of a representative sample of the general 
30 population to develop demographic information for each of the viewers. 

28. The method of claim 27, wherein processing information indicative of preferences of 

each of the viewers in accordance with a predictive model based on the viewing habits of a 

representative sample of the general population comprises: 
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processing information indicative of preferences of each of the viewers in accordance 
with a predictive model that predicts each demographic trait of the viewer based on viewer 
preferences for television programs that attract a higher proportion of viewers exhibiting the 
demographic trait than is exhibited by the representative sample of the general population. 

.5 

29. The method of claim 28, wherein processing information indicative of preferences of 
each of the viewers in accordance with a predictive model that predicts each demographic trait 
of the viewer based on viewer preferences for television programs that attract a higher 
proportion of viewers exhibiting the demographic trait than is exhibited by the representative 

1 0 sample of the general population further comprises: 

processing information indicative of preferences of each of the viewers in accordance 
with a predictive model that predicts each demographic trait of the viewer based on viewer 
preferences for television programs that attract a higher proportion of viewers exhibiting the 
demographic trait than is exhibited by the representative sample of the general population and 

1 5 that exhibit minimal demographic trait correlation with other television programs that attract a * 
higher proportion of viewers exhibiting other demographic traits than is exhibited by the 
representative sample of the general population. 

30. The method of claim 26, wherein processing information indicative of preferences of 
20 each of the plurality of viewers tc^demographic information for each of the viewers further 

comprises : dfc Vfc p 

developing a television program viewing preference profile for each of the viewers in 
accordance with the viewer demographic information. 

25 31. The method of claim 1 7 or 30, wherein processing information indicative of preferences 
of each of the plurality of viewers to develop a television program viewing preference profile for 
each of the viewers further comprises: 

processing information indicative of preferences of each of the plurality of viewers to 
develop a plurality of television program viewing preference profiles for each of the viewers. 



30 



32. The method of claim 31, wherein processing information indicative of preferences of 
each of the plurality of viewers to develop a plurality of television program viewing preference 
profiles for each of the viewers comprises: 
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processing information indicative of preferences of each of the plurality of viewers to 
develop a plurality of television program viewing preference profiles for each of the viewers, 
each profile describing the television program viewing preferences of the viewer at a different 
time of day, time of week, or season. 



33. The method of claim 17 or 30, wherein processing information indicative of preferences 
of each of the plurality of viewers to develop a television program viewing preference profile for 
each of the viewers further comprises: 

processing information indicative of preferences of each of a plurality of viewers 
10 accessing the same television equipment to develop a plurality of television program viewing 
preference profiles for each of the viewers. 

34. The method of claim 33, wherein configuring a set of video programming segments 
comprises: 

1 5 configuring a set of video programming segments for the viewers accessing the same 

television equipment, at least one of the video programming segments selected from a plurality 
of available video programming segments, to create an apparently linear program for linear 
delivery to the viewers in accordance with characteristics information of the viewers. 

20 35. The method of claims 1 7 or 30, wherein configuring a set of video programming 
segments for each viewer comprises: 

configuring a set of video programming segments for each viewer, at least one of the 
video programming segments selected from a plurality of available video programming 
segments, for recording in accordance with the television program viewing preference profile of 

25 the viewer. 

36. The method of claim 1, wherein configuring a set of video programming segments 
comprises: 

configuring a set of video programming segments selected from the group of video 
30 programming segments comprising advertising, entertainment, news, weather, financial, sports, 
educational, and shopping programming. 

37. The method of claim 36, wherein configuring a set of video programming segments 
further comprises: 
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configuring a set of video programming segments for each viewer, at least one of the 
video programming segments selected from a plurality of available video programming 
segments, for recording in accordance with the viewer characteristics information. 

5 38. The method of claim 36, wherein configuring a set of video programming segments 
further comprises: 

configuring a set of video programming segments for each viewer, at least one of the 
video programming segments selected from a plurality of available video programming 
segments, to create an apparently linear program for linear delivery to the viewer on at least one 
10 dedicated channel in accordance with the viewer characteristics information. 

39. The method of claim 36, wherein configuring a set of video programming segments 
further comprises: 

presenting a listing of the set of video programming segments to the user for the user to 
15 select therebetween. 

40. The method of claim 1, wherein configuring a set of video programming segments 
further comprises: 

selecting one or more of the video programming segments from a plurality of available 
20 video programming segments to create an apparently linear program for linear delivery to the 

viewer, the program exhibiting content customized in accordance with the viewer characteristics 
information. 

41. The method of claim 1, wherein configuring a set of video programming segments 
25 further comprises: 

selecting one or more of the video programming segments from a plurality of available 
video programming segments to create an apparently linear program for linear delivery to the 
viewer, the program exhibiting content targeted to the viewer characteristics information. 

30 42. The method of claim 41 , wherein selecting one or more of the video programming 
segments comprises: 

selecting one or more of the video programming segments from a plurality of available 
video programming segments to create an apparently linear program for linear delivery to the 
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viewer, the program exhibiting content targeted to the viewer characteristics information by the 
providers of the video programming segments. 

43. The method of claim 1 7, wherein configuring a set of video programming segments 
5 further comprises: 

selecting one or more of the video programming segments from a plurality of available 
video programming segments to create an apparently linear program for linear delivery to the 
viewer, the program exhibiting content customized in accordance with the viewer television 
program viewing preference profile. 

10 

44. The method of claim 17, wherein configuring a set of video programming segments 
further comprises: 

selecting one or more of the video programming segments from a plurality of available 
video programming segments to create an apparently linear program for linear delivery to the 
15 viewer, the program exhibiting content targeted to the viewer television program viewing 
preference profile. 

45. The method of claim 44, wherein selecting one or more of the video programming 
segments comprises: 

20 selecting one or more of the video programming segments from a plurality of available 

video programming segments to create an apparently linear program for linear delivery to the 
viewer, the program exhibiting content targeted to the viewer television program viewing 
preference profile by the providers of the video programming segments. 

25 46. The method of claim 45, wherein selecting one or more of the video programming 
segments comprises: 

selecting one or more of the video programming segments from a plurality of available 
advertising video programming segments to create an apparently linear program for linear 
delivery to the viewer, the program exhibiting advertising content targeted to the viewer 
30 television program viewing preference profile by the providers of the advertising video 
programming segments. 

47. The method of claim 26, wherein configuring a set of video programming segments 
further comprises: 
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selecting one or more of the video programming segments from a plurality of available 
video programming segments to create an apparently linear program for linear delivery to the 
viewer, the program exhibiting content customized in accordance with the viewer demographic 
information. 

5 

48. The method of claim 26, wherein configuring a set of video programming segments 
further comprises: 

selecting one or more of the video programming segments from a plurality of available 
video programming segments to create an apparently linear program for linear delivery to the 
1 0 viewer, the program exhibiting content targeted to the viewer demographic information. 

49. The method of claim 48, wherein selecting one or more of the video programming 
segments comprises: 

selecting one or more of the video programming segments from a plurality of available 
1 5 video programming segments to create an apparently linear program for linear delivery to the 
viewer, the program exhibiting content targeted to the viewer demographic information by the 
providers of the video programming segments. 

50. The method of claim 49, wherein selecting one or more of the video programming 
20 segments comprises: 

selecting one or more of the video programming segments from a plurality of available 
advertising video programming segments to create an apparently linear program for linear 
delivery to the viewer, the program exhibiting advertising content targeted to the viewer 
demographic information by the providers of the advertising video programming segments. 

• 25 ■ 

5 1 . The method of claims 40, 41, 43, or 47, wherein configuring a set of video programming 
segments comprises: 

configuring a set of video programming segments selected from the group of video 
programming segments comprising advertising, entertainment, news, weather, financial, sports, 
30 educational, and shopping programming. 
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52. A system to deliver customized linear video programming to each of a plurality of : 
individual viewers, comprising: 

a processor to process information indicative of preferences of each of the plurality of 
viewers to develop viewer characteristics information for each of the viewers; and 
5 a programmer to configure a set of video programming segments for each viewer, at least 

one of the video programming segments selected from a plurality of available video 

programming segments, to create an apparently linear program for linear delivery to the viewer 1 
in accordance with the viewer characteristics information. 

10 53 . The system of claim 52, wherein the programmer further comprises: 

a programmer to select at least one broadcast video programming segment for linear I 
delivery to the viewer concurrent with the broadcast of the video segment to create the 



30 



58. The system of claim 55, wherein the processor comprises: 

a processor to process information indicative of television programs not watched by each 
of the plurality of viewers. 

- 69 - 
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apparently linear program. | 

1 5 54. The system of claim 52, wherein the programmer further comprises: 

a programmer to select at least one video programming segment stored on a storage 
medium for linear delivery to the viewer to create the apparently linear program. 

55. The system of claim 52, wherein the processor comprises: 
20 a processor to process information indicative of television program viewing preferences 

of each of the plurality of viewers. 

56. The system of claim 55, wherein the processor comprises: 
a processor to process information indicative of television programs watched by each of 

25 the plurality of viewers. 

57. The system of claim 55, wherein the processor comprises: 
a processor to process information indicative of television programs recorded by each of 

the plurality of viewers. 



BNSDOCID: <WO 01 17250A1_IA> 



WO 01/017250 



PCTAJSOO/23868 



59. The system of claim 55, wherein the processor comprises: 

a processor to process information indicative of television program guide information 
requested by each of the plurality of viewers. 

.t 
% 

5 60. The system of claim 55, wherein the processor comprises: i 
a processor to process information indicative of television program guide information not ? 
requested by each of the plurality of viewers. | 

■ ' ' I 

61 . The system of claims 56, 57, 58, 59, or 60, wherein the processor comprises: J 
10 a processor to process electronic program guide information. f ■ 



15 



62. The system of claim 52, wherein the processor comprises: 

a processor to process information indicative of preferences of each of the plurality of 
viewers provided by each of the viewers in response to queries. 

63. The system of claim 52, wherein the processor comprises: 

a processor to process information indicative of preferences other than television 
program viewing preferences of each of the plurality of viewers. 



20 64. The system of claim 63, wherein the processor comprises: 

a processor to process information indicative of musical preferences of each of the 
plurality of viewers. 

65. The system of claim 63, wherein the processor comprises: 

25 a processor to process information indicative of reading preferences of each of the 

plurality of viewers. 

66. The system of claim 63, wherein the processor comprises: 

a processor to process information indicative of shopping preferences of each of the 
30 plurality of viewers. 

67. The system of claim 63, wherein the processor comprises: 

a processor to process information indicative of preferences other than television 

program viewing preferences of each of the plurality of viewers acquired from the group of 
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sources comprising on-line music clubs, on-line book clubs, on-line special interest clubs and 
organizations, and on-line retailers and merchants. 

68. The system of claim 52, 55, 62, or 63, wherein the processor comprises: 

a processor to process information indicative of preferences of each of the plurality of 
viewers to develop a television program viewing preference profile for each of the viewers. 

69. The system of claim 68, wherein the processor comprises: 

a processor to process information indicative of preferences of each of the plurality of 
viewers in accordance with a predictive model based on the viewing habits of a representative 
sample of the general population to develop a television program viewing preference profile for 
each of the viewers. 

70. The system of claim 69, wherein the processor further comprises: 

a Bayesian network to calculate maximum a posteriori values for the parameters of the 
predictive model to predict television program viewing preferences for each viewer. 

7 1 . The system of claim 68, wherein the processor comprises: 

a processor to process information indicative of preferences of each of the plurality of 
viewers to deduce hidden traits of each viewer. 



72. The system of claim 68, wherein the processor comprises: 

a processor to process information indicative of preferences of each of the plurality of 
viewers to deduce associated traits of each viewer. 

73. The system of claim 68, further comprising: 

a processor to process the television program viewing preference profile developed for 
each of the viewers to develop demographic information for each of the viewers. 

74. The system of claim 73, wherein the processor comprises: 

a processor to process the television program viewing preference profile developed for 
each of the viewers in accordance with a predictive model based on the viewing habits of a 
representative sample of the general population to develop demographic information for each of 
the viewers. 
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75. The system of claim 74, wherein the processor comprises: 
a processor to process the television program viewing preference profile developed for 

each of the viewers in accordance with a predictive model that predicts each demographic trait 
5 of the viewer based on viewer preferences for television programs that attract a higher 

proportion of viewers exhibiting the demographic trait than is exhibited by the representative 
sample of the general population. 

76. The system of claim 75, wherein the processor comprises: 
10 a processor to process the television program viewing preference profile developed for 

each of the viewers in accordance with a predictive model that predicts each demographic trait 
of the viewer based on viewer preferences for television programs that attract a higher 
proportion of viewers exhibiting the demographic trait than is exhibited by the representative 
sample of the general population and that exhibit minimal demographic trait correlation with 
15 other television programs that attract a higher proportion of viewers exhibiting other 

demographic traits than is exhibited by the representative sample of the general population. 

77. The system of claim 52, wherein the processor comprises: 
a processor to process information indicative of preferences of each of the plurality of 

20 viewers ^demographic information for each of the viewers. 

develop 

78. The system of claim 77, wherein the processor comprises: 
a processor to process information indicative of preferences of each of the viewers in 

accordance with a predictive model based on the viewing habits of a representative sample of 
25 the general population to develop demographic information for each of the viewers. 

79. The system of claim 78, wherein the processor comprises: 
a processor to process information indicative of preferences of each of the viewers in 

f 

accordance with a predictive model that predicts each demographic trait of the viewer based on 
30 viewer preferences for television programs that attract a higher proportion of viewers exhibiting 
the demographic trait than is exhibited by the representative sample of the general population. 

80. The system of claim 79, wherein the processor comprises: 
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a processor to process information indicative of preferences of each of the viewers in 
accordance with a predictive model that predicts each demographic trait of the viewer based on 
viewer preferences for television programs that attract a higher proportion of viewers exhibiting 
the demographic trait than is exhibited by the representative sample of the general population 
5 and that exhibit minimal demographic trait correlation with other television programs that attract 
a higher proportion of viewers exhibiting other demographic traits than is exhibited by the 
representative sample of the general population. 

8 1 . The system of claim 77, wherein the processor comprises: 

1 0 a processor to develop a television program viewing preference profile for each of the 

viewers in accordance with the viewer demographic information. 

82. The system of claim 68 or 8 1 , wherein the processor comprises: 

a processor to process information indicative of preferences of each of the plurality of 
1 5 viewers to develop a plurality of television program viewing preference profiles for each of the 
viewers. 

83. The system of claim 82, wherein the processor comprises: 

a processor to process information indicative of preferences of each of the plurality of 
20 viewers to develop a plurality of television program viewing preference profiles for each of the 
viewers, each profile describing the television program viewing preferences of the viewer at a 
different time of day, time of week, or season. 

84. The system of claim 68 or 81* wherein the processor comprises: 

25 a processor to process information indicative of preferences of each of a plurality of 

viewers accessing the same television equipment to develop a plurality of television program 
viewing preference profiles for each of the viewers. 



85. The system of claim 84, wherein the programmer comprises: 
30 a programmer to configure a set of video programming segments for the viewers 

accessing the same television equipment, at least one of the video programming segments 
selected from a plurality of available video programming segments, to create an apparently 
linear program for linear delivery to the viewers in accordance with characteristics information 
of the viewers. 
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86. The system of claims 68 or 8 1 , wherein the programmer comprises: 

a programmer to configure a set of video programming segments for each viewer, at least 
one of the video programming segments selected from a plurality of available video 
5 programming segments, for recording in accordance with the television program viewing 
preference profile of the viewer. 



87. The system of claim 52, wherein the programmer comprises: 

a programmer to configure a set of video programming segments selected from the group 
10 of video programming segments comprising advertising, entertainment, news, weather, 
financial, sports, educational, and shopping programming. 

88. The system of claim 87, wherein the programmer comprises: 

a programmer to configure a set of video programming segments for each viewer, at least 
15 one of the video programming segments selected from a plurality of available video 

programming segments, for recording in accordance with the viewer characteristics information. 



89. The system of claim 87, wherein the programmer comprises: 

a programmer to configure a set of video programming segments for each viewer, at least 
20 one of the video programming segments selected from a plurality of available video 

programming segments, to create an apparently linear program for linear delivery to the viewer 
on at least one dedicated channel in accordance with the viewer characteristics information. 

90. The system of claim 87, wherein the programmer comprises: 

25 a programmer to present a listing of the set of video programming segments to the user 

for the user to select therebetween. 

91 . The system of claim 52, wherein the programmer comprises: 

a programmer to select one or more of the video programming segments from a plurality 
30 of available video programming segments to create an apparently linear program for linear 

delivery to the viewer, the program exhibiting content customized in accordance with the viewer 
characteristics information. 

92. The system of claim 52, wherein the programmer comprises: 
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a programmer to select one or more of the video programming segments from a plurality 
of available video programming segments to create an apparently linear program for linear 
delivery to the viewer, the program exhibiting content targeted to the viewer characteristics 
information. 

5 

93. The system of claim 92, wherein the programmer comprises: 

a programmer to select one or more of the video programming segments from a plurality 
of available video programming segments to create an apparently linear program for linear 
delivery to the viewer, the program exhibiting content targeted to the viewer characteristics 
1 0 information by the providers of the video programming segments. 

94. The system of claim 68, wherein the programmer comprises: 

a programmer to select one or more of the video programming segments from a plurality 
of available video programming segments to create an apparently linear program for linear 
1 5 delivery to the viewer, the program exhibiting content customized in accordance with the viewer 
television program viewing preference profile. 

95. The system of claim 68, wherein the programmer comprises: 

a programmer to select one or more of the video programming segments from a plurality 
20 of available video programming segments to create an apparently linear program for linear 

delivery to the viewer, the program exhibiting content targeted to the viewer television program 
viewing preference profile. 

96. The system of claim 95, wherein the programmer comprises: 

25 a programmer to select one or more of the video programming segments from a plurality 

of available video programming segments to create an apparently linear program for linear 
delivery to the viewer, the program exhibiting content targeted to the viewer television program 
viewing preference profile by the providers of the video programming segments. 

30 97. The system of claim 96, wherein the programmer comprises: 

a programmer to select one or more of the video programming segments from a plurality 
of available advertising video programming segments to create an apparently linear program for 
linear delivery to the viewer, the program exhibiting advertising content targeted to the viewer 
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television program viewing preference profile by the providers of the advertising video 
programming segments. 

98. The system of claim 77, wherein the programmer comprises: 
5 a programmer to select one or more of the video programming segments from a plurality 

of available video programming segments to create an apparently linear program for linear 
delivery to the viewer, the program exhibiting content customized in accordance with the viewer 
demographic information. 

10 99. The system of claim 77, wherein the programmer comprises: 

a programmer to select one or more of the video programming segments from a plurality 
of available video programming segments to create an apparently linear program for linear 
delivery to the viewer, the program exhibiting content targeted to the viewer demographic 
information. 

15 

100. The system of claim 99, wherein the programmer comprises: 

a programmer to select one or more of the video programming segments from a plurality 
of available video programming segments to create an apparently linear program for linear 
delivery to the viewer, the program exhibiting content targeted to the viewer demographic 
20 information by the providers of the video programming segments. 

101 . The system of claim 100, wherein the programmer comprises: 

a programmer to select one or more of the video programming segments from a plurality 
of available advertising video programming segments to create an apparently linear program for 
25 linear delivery to the viewer, the program exhibiting advertising content targeted to the viewer 
demographic information by the providers of the advertising video programming segments. 

102. The system of claims 91, 92, 94, or 98, wherein the programmer comprises: 

a programmer to configure a set of video programming segments selected from the group 
30 of video programming segments comprising advertising, entertainment, news, weather, 
financial, sports, educational, and shopping programming. 
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